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I Preface - what a difference 21 years make! 

Twenty-one years after the publication of the l'"* edition of Applied Discrete Structures for Computer Science, in 1989 the publishing and 
computing landscape have both changed dramatically. We signed a contract for the second edition with Science Research Associates but by the 
time the book was ready to print, SRA had been sold to MacMillan. Soon after, the rights had been passed on to Pearson Education, Inc. In 
2010, the long-term future of printed textbooks is uncertain. In the meantime, textbook prices (both printed and e-books) have increased and a 
growing open source textbook market movement has started. One of our objectives in revisiting this text is to make it available to our students 
in an affordable format. In its original form, the text was peer-reviewed and was adopted for use at several universities throughout the country. 
For this reason, we see Applied Discrete Structures as not only an inexpensive alternative, but a high quality alternative. 

As indicated above the computing landscape is very different from the 1980's and accounts for the most significant changes in the text. One of 
the most common programming languages of the 1980's, Pascal; and we used it to illustrate many of the concepts in the text. Although it isn't 
totally dead, Pascal is far from the mainstream of computing in the 21*' century. In 1989, Mathematica had been out for less than a year — 
now a major force in scientific computing. The open source software movement also started in the 1980's and in 2005, the first version of 
Sage, an open-source alternative to Mathematica was first released. In Applied Discrete Structures we have replaced "Pascal Notes" with 
"Mathematica Notes" and "Sage Notes." Finally, 1989 was the year that World Wide Web was invented by Tim Berners-Lee. There wasn't a 
single www in the 2"'' edition. In this version, we intend to make use of extensive web resources, including video demonstrations. 
We would like to thank Tony Penta, Sitansu Mittra, and Dan Klain for using the preliminary versions of Applied Discrete Structures. The 
corrections and input they provided was appreciated. 

We repeat the preface to Applied Discrete Structures for Computer Science below. Plans for the instructor's guide, which is mentioned in the 
preface are imcertain at this time. 

I Preface to Applied Discrete Structures for Computer Science, 2nd Ed. 

We feel proud and fortunate that most authorities, including MAA and ACM, have settled on a discrete mathematics syllabus that is virtually 
identical to the contents of the first edition of Applied Discrete Structures for Computer Science. For that reason, very few topical changes 
needed to be made in this new edition, and the order of topics is almost unchanged. The main change is the addition of a large number of 
exercises at all levels. We have "fine-tuned" the contents by expanding the preliminary coverage of sets and combinatorics, and we have added 
a discussion of binary integer representation. We have also added an introduction including several examples, to provide motivation for those 
students who may find it reassuring to know that mathematics has "real" applications. "Appendix B— Introduction to Algorithms," has also 
been added to make the text more self-contained. 

How This Boole Will Help Students 

In writing this book, care was taken to use language and examples that gradually wean students from a simpleminded mechanical approach and 
move them toward mathematical maturity. We also recognize that many students who hesitate to ask for help from an instructor need a readable 
text, and we have tried to anticipate the questions that go unasked. 

The wide range of examples in the text are meant to augment the "favorite examples" that most instructors have for teaching the topics in 
discrete mathematics. 

To provide diagnostic help and encouragement, we have included solutions and/or hints to the odd-numbered exercises. These solutions include 
detailed answers whenever warranted and complete proofs, not just terse outlines of proofs. 

Our use of standard terminology and notation makes Applied Discrete Structures for Computer Science a valuable reference book for future 
courses. Although many advanced books have a short review of elementary topics, they caimot be complete. 

How This Book Will Help Instructors 

The text is divided into lecture-length sections, facilitating the organization of an instructor's presentation. 

Topics are presented in such a way that students' vmderstanding can be monitored through thought-provoking exercises. The exercises require 
an imderstanding of the topics and how they are interrelated, not just a familiarity with the key words. 

An Instructor's Guide is available to any instructor who uses the text. It includes: 

(a) Chapter-by-chapter comments on subtopics that emphasize the pitfalls to avoid; 

(b) Suggested coverage times; 

(c) Detailed solutions to most even-numbered exercises; 

(d) Sample quizzes, exams, and final exams. 
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How This Book Will Help the Chairperson/Coordinator 

The text covers the standard topics that all instructors must be aware of; therefore it is safe to adopt Applied Discrete Structures for Computer 

Science before an instructor has been selected. 

The breadth of topics covered allows for flexibility that may be needed due to last-minute curriculum changes. 

Since discrete mathematics is such a new course, faculty are often forced to teach the course without being completely familiar with it. An 
Instructor's Guide is an important feature for the new instructor. 

What a Difference Five Years Malces! 

In the last five years, much has taken place in regards to discrete mathematics. A review of these events is in order to see how they have 
affected the Second Edition of Applied Discrete Structures for Computer Science. 

(1) Scores of discrete mathematics texts have been published. Most texts in discrete mathematics can be classified as one-semester or two- 
semester texts. The two-semester texts, such as Applied Discrete Structures for Computer Science, differ in that the logical prerequisites for a 
more thorough study of discrete mathematics are developed. 

(2) Discrete mathematics has become more than just a computer science support course. Mathematics majors are being required to take it, often 
before calculus. Rather than reducing the significance of calculus, this recognizes that the material a student sees in a discrete mathematics/struc- 
tures course strengthens his or her understanding of the theoretical aspects of calculus. This is particularly important for today's students, since 
many high school courses in geometry stress mechanics as opposed to proofs. The typical college freshman is skill-oriented and does not have a 
high level of mathematical maturity. Discrete mathematics is also more typical of the higher-level courses that a mathematics major is likely to 
take. 

(3) Authorities such as MAA, ACM, and A. Ralson have all refined their ideas of what a discrete mathematics course should be. Instead of the 
chaos that characterized the early '80s, we now have some agreement, namely that discrete mathematics should be a course that develops 
mathematical maturity. 

(4) Computer science enrollments have leveled off and in some cases have declined. Some attribute this to the lay-offs that have taken place in 
the computer industry; but the amount of higher mathematics that is needed to advance in many areas of computer science has also discouraged 
many. A year of discrete mathematics is an important first step in overcoming a deficiency in mathematics. 

(5) The Educational Testing Service introduced its Advanced Placement Exam in Computer Science. The suggested preparation for this exam 
includes many discrete mathematics topics, such as trees, graphs, and recursion. This continues the trend toward offering discrete mathematics 
earlier in the overall curriculum. 

Acl<nowledgments 

The authors wish to thank our colleagues and students for their comments and assistance in writing and revising this text. Among those who 
have left their mark on this edition are Susan Assmaim, Shim Berkovitz, Tony Penta, Kevin Ryan, and Richard Winslow. 
We would also like to thank Jean Hutchings, Kathy SuUivan, and Michele Walsh for work that they did in typing this edition, and our depart- 
ment secretaries, Mrs. Lyn Misserville and Mrs. Danielle White, whose cooperation in numerous ways has been greatly appreciated. 
We are grateful for the response to the first edition from the faculty and students of over seventy-five colleges and universities. We know that 
our second edition will be a better leaming and teaching tool as a result of their useful comments and suggestions. Our special thanks to the 
following reviewers: David Buchthal, University of Akron; Ronald L. Davis, Millersville University; John W Kennedy, Pace University; Betty 
Mayfield, Hood College; Nancy Olmsted, Worcester State College; and Pradip Shrimani, Southern Illinois University. Finally, it has been a 
pleasure to work with Nancy Osman, our acquisitions editor, David Morrow, our development editor, and the entire staff at SRA. 

A.W. D. 
K.M.L. 
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What Is Discrete Mathematics/Structures? 

What is Discrete Mathiematics? 

As a general description one could say that discrete mathematics is the mathematics that deals with "separated" or discrete sets of objects rather 
than with continuous sets such as the real line. For example, the graphs that we learn to draw in high school are of continuous functions. Even 
though we might have begun by plotting discrete points on the plane, we connected them with a smooth, continuous, unbroken curve to form a 
straight line, parabola, circle, etc. The underlying reason for this is that hand methods of calculation are too laborious to handle huge amounts of 
discrete data. The computer has changed all of this. 

Today, the area of mathematics that is broadly called "discrete" is that which professionals feel is essential for people who use the computer as 
a fundamental tool. It can best be described by looking at our Table of Contents. It involves topics like sets, logic, and matrices that students 
may be already familiar with to some degree. In this Introduction, we give several examples of the types of problems a student will be able to 
solve as a result of taking this course. The intent of this Introduction is to provide an overview of the text. Students should read the examples 
through once and then move on to Chapter One. After completing their study of discrete mathematics, they should read them over again. 

We hope discrete mathematics is as fascinating and enjoyable to the student as it has been to us. 

Example l.a. Analog-to-digital Conversion. A common problem encountered in engineering is that of analog-to-digital (a-d) conversion, where 
the reading on a dial, for example, must be converted to a numerical value. In order for this conversion to be done reliably and quickly, one 
must solve an interesting problem in graph theory. Before this problem is posed, we will make the connection between a-d conversion and the 
graph problem using a simple example. Suppose a dial in a video game can be turned in any direction, and that the positions will be converted 
to one of the numbers zero through seven in the following way. As depicted in Figure I.a.l , the angles from to 360 are divided into eight equal 
parts, and each part is assigned a number starting with and increasing clockwise. If the dial points in any of these sectors the conversion is to 
the number of that sector. If the dial is on the boundary, then we will be satisfied with the conversion to either of the numbers in the bordering 
sectors. This conversion can be thought of as giving an approximate angle of the dial, for if the dial is in sector k, then the angle that the dial 
makes with east is approximately 45 k° . 




FIGURE l.a.1 



Now that the desired conversion has been described, we will describe a "solution" that has one major error in it, and then identify how this 
problem can be rectified. All digital computers represent numbers in binary form, as a sequence of Os and Is called bits, short for binary digits. 
The binary representations of numbers through 7 are: 






= 000 


= 0x4 


+ 


0x2 


-1- 


Ox 


1 


= 001 


= 0x4 


+ 


0x2 


-1- 


1 x 


2 


= 010 


= 0x4 


+ 


1 x2 


-1- 


Ox 


3 


= oil 


= 0x4 


+ 


1x2 


-1- 


1 x 
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= 100 


= 1x4 


+ 


0x2 


-1- 


Ox 


5 


= 101 


= 1x4 


+ 


0x2 


-1- 


1 X 


6 


= no 


= 1x4 


+ 


1 x2 


-1- 


Ox 


7 


= 111 


= 1x4 


+ 


1 x2 


-1- 


1 x 



We will discuss the binary number system in Chapter 1 . The way that we could send those bits to a computer is by coating parts of the back of 
the dial with a metallic substance, as in Figure I.a.2. For each of the three concentric circles on the dial there is a small magnet. If a magnet lies 
under a part of the dial that has been coated with metal, then it will turn a switch ON, whereas the switch stays OFF when no metal is detected 
above a magnet. Notice how every ON/OFF combination of the three switches is possible given the way the back of the dial is coated. 

If the dial is placed so that the magnets are in the middle of a sector, we expect this method to work well. There is a problem on certain 
boundaries, however. If the dial is turned so that the magnets are between sectors three and four, for example, then it is unclear what the result 
will be. This is due to the fact that each magnet will have only a fraction of the required metal above it to turn its switch ON. Due to expected 
irregularities in the coating of the dial, we can be safe in saying that for each switch either ON or OFF could be the result, and so if the dial is 
between sectors three and four, any number could be indicated. This problem does not occur between every sector. For example, between 
sectors and 1, there is only one switch that cannot be predicted. No matter what the outcome is for the units switch in this case, the indicated 
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sector must be either or 1 , which is consistent with the original objective that a positioning of the dial on a boundary of two sectors should 
produce the number of either sector. 




-vutunl 
I I - ilal coatBd 



■■wrdKte- Units M 
titSat c'lKii - Tw(K Hi 
thJltr rink - Rmwj bii 




FIGURE l.a.2 



Is there a way to coat the sectors on the back of the dial so that each of the eight patterns corresponding to the numbers to 7 appears once, and 
so that between any two adjacent sectors there is only one switch that will have a questionable setting? One way of trying to answer this 
question is by using an undirected graph called the 3-cube (Figure I.a.3). In general, an undirected graph consists of vertices (the circled O's and 
I's in the 3-cube) and the edges, which are lines that connect certain pairs of vertices. Two vertices in the 3-cube are connected by an edge if the 
sequences of the three bits differ in exactly one position. If one could draw a path along the edges in the 3-cube that starts at any vertex, passes 
through every other vertex once, and returns to the start, then that sequence of bit patterns can be used to coat the back of the dial so that 
between every sector there is only one questionable switch. Such a path is not difficult to find; so we will leave it to you to find one, starting at 
000 and drawing the sequence in which the dial would be coated. 




FIGURE I.a.3 



Many A-D conversion problems require many more sectors and switches than this example, and the same kinds of problems can occur. The 
solution would be to find a path within a much larger yet similar graph. For example, there might be 1 ,024 sectors with 10 switches, resulting in 
a graph with 1,024 vertices. One of the objectives of this text will be to train you to understand the thought processes that are needed to attack 
such large problems. In Chapter 9 we will take a closer look at graph theory and discuss some of its applications. 

One question might come to mind at this point. If the coating of the dial is no longer as it is in Figure I.a.2, how would you interpret the patterns 
that are on the back of the dial as numbers from to 7? In Chapter 14 we will see that if a certain path is used, this "decoding" is quite easy. 

The 3-cube and its generalization, the n-cube, play a role in the design of a multiprocessor called a hypercube. A multiprocessor is a computer 
that consists of several independent processors that can operate simultaneously and are connected to one another by a network of connections. 
In a hypercube with M = 2" processors, the processors are numbered to M - 1. Two processors are connected if their binary representations 
differ in exactly one bit. The hypercube has proven to be the best possible network for certain problems requiring the use of a "supercomputer." 
Denning's article in the May-June 1987 issue of "American Scientist" provides an excellent survey of this topic. 

Example I.b. Logic is the cornerstone of all communication, whether we wish to communicate in mathematics or in any other language. It is 
the study of sentences, or propositions, that take on the values true or false, 1 or in the binary system. Its importance was recognized in the 
very early days of the development of logic (hardware) design, where Boolean algebra, the algebra of logic, was used to simplify electronic 
circuitry called gate diagrams. Consider the following gate diagram: 




FIGURE l.b.1 



Each symbol in this diagram is called a gate, a piece of hardware. In Chapter 13 we will discuss these circuits in detail. Assume that this 
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circuitry can be placed on a chip which will have a cost dependent on the number of gates involved. A classic problem in logic design is to try 
to simplify this circuitry to one containing fewer gates. Indeed, the gate diagram can be reduced to 




FIGURE l.b.2 



The result is a less costly chip. Since a company making computers uses millions of chips, we have saved a substantial amount of money. 

This use of logic is only the "tip of the iceberg." The importance of logic for computer scientists in particular, and for all people who use 
mathematics, cannot be overestimated. It is the means by which we can think and write clearly and precisely. Logic is used in writing algo- 
rithms, in testing the correctness of programs, and in other areas of computer science. 

Example I.c. Suppose two students miss a class on a certain day and borrow the class notes in order to obtain copies. If one of them copies the 
notes by hand and the other walks to a "copy shop," we might ask which method is more efficient. To keep things simple, we will only consider 
the time spent in copying, not the cost. We add a few more assumptions: copying the first page by hand takes one minute and forty seconds 
(100 seconds); for each page copied by hand, the next page will take five more seconds to copy, so that it takes 1:45 to copy the second page, 
1:50 to copy the third page, etc.; photocopiers take five seconds to copy one page; walking to the "copy shop" takes ten minutes, each way. 

One aspect of the problem that we have not specified is the number of pages to be copied. Suppose the number of pages is n, which could be 
any positive integer. As with many questions of efficiency, one method is not clearly better than the other for all cases. Since the only variable 
in this problem is the number of pages, we can simply compare the copying times for different values of n. We will denote the time it takes (in 
seconds) to copy n pages manually by ?;,(«), and the time to copy n pages automatically by J„(n). Ideally, we would like to have formulas to 
represent the values of J;,(n) and ?„(«). The process of finding these formulas is an important one that we will examine in Chapter 8. The 
formula for tjn) is not very difficult to derive from the given information. To copy pages automatically, one must walk for twenty minutes 
(1 ,200 seconds), and then for each page wait five seconds. Therefore, t^in) = 1200 + 5n 

The formula for ?,,(«) isn't quite as simple. First, let p(n) be the number of seconds that it takes to copy page n. From the assumptions, 
p(l) = 100, and if n is greater than one, p{n) = p(n - \) + 5. The last formula is called a recurrence relation. We will spend quite a bit of 
time discussing methods for deriving formulas from recurrence relations. In this case p (n) = 95 + 5 n. Now we can see that if n is greater than 
one, 

?,,(«) = p{\) + p(2) + ■■■+ p{n) = ti,in - 1) + p{n) = t^(n - 1) + 5 n + 95 
This is yet another recurrence relation. The solution to this one is tf^in) = 97.5 « + 2.5 « . 

Now that we have these formulas, we can analyze them to determine the values of n for which hand copying is most efficient, the values for 
which photocopying is most efficient, and also the values for which the two methods require the same amount of time. 

WHAT IS DISCRETE STRUCTURES? 

So far we have given you several examples of that area of mathematics called discrete mathematics. Where does the "structures" part of the title 
come from? We will look not only at the topics of discrete mathematics but at the structure of these topics. If two people were to explain a 
single concept, one in German and one in French, we as observers might at first think they were expressing two different ideas, rather than the 
same idea in two different languages. In mathematics we would like to be able to make the same distinction. Also, when we come upon a new 
mathematical structure, say the algebra of sets, we would like to be able to determine how workable it will be. How do we do this? We compare 
it to something we know, namely elementary algebra, the algebra of numbers. When we encounter a new algebra we ask ourselves how similar 
it is to elementary algebra. What are the similarities and the dissimilarities? When we know the answers to these questions we can use our vast 
knowledge of basic algebra to build upon rather than learning each individual concept from the beginning. 
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chapter 1 

B 

SET THEORY I 



GOALS 

In this chapter we will cover some of the basic set language and notation that will be used throughout the text. Venn diagrams will be intro- 
duced in order to give the reader a clear picture of set operations. In addition, we will describe the binary representation of positive integers 
(Section 1.4) and introduce summation notation and its generalizations (Section 1.5). 
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Chapter 1 - Set Theory 



1 1.1 Set Notation and Relations 

The term set is intuitively understood by most people to mean a collection of objects that are called elements (of the set). This concept is the 
starting point on which we will build more complex ideas, much as in-geometry where the concepts of point and line are left undefined. 

Because a set is such a simple notion, you may be surprised to learn that it is one of the most difficult concepts for mathematicians to define to 
their own liking. For example, the description above is not a proper definition because it requires the definition of a collection. (How would you 
define "collection"?) Even deeper problems arise when you consider the possibility that a set could contain itself. Although these problems are 
of real concern to some mathematicians, they will not be of any concern to us. 

Our first concern will be how to describe a set; that is, how do we most conveniently describe a set and the elements that are in it? If we are 
going to discuss a set for any length of time, we usually give it a name in the form of a capital letter (or occasionally some other symbol). In 
discussing set A, if x is an element of A, then we will write x e A. On the other hand, if x is not an element of A, we write x ^ A. The most 
convenient way of describing the elements of a set will vary depending on the specific set. 

Method 1: Enumeration. When the elements of a set are enumerated (or listed) it is traditional to enclose them in braces. For example, the set 
of binary digits is {0, 1) and the set of decimal digits is {0, 1, 2, 3, 4, 5, 6, 7, 8, 9). The choice of a name for these sets would be arbitrary; 
but it would be "logical" to call them B and D , respectively. The choice of a set name is much like the choice of an identifier name in program- 
ming. Some large sets can be enumerated without actually listing all the elements. For example, the letters of the alphabet and the integers from 
1 to 100 could be described as 

A = [a, b, c, X, y, z},andG = {1, 2, ... , 99, 100}. 

The three consecutive "dots" are called an ellipsis. We use them when it is clear what elements are included but not listed. An ellipsis is used in 
two other situations. To enumerate the positive integers, we would write {1, 2, 3, ...), indicating that the list goes on infinitely. If we want to 
list a more general set such as the integers between 1 and n, where n is some undetermined positive integer, we might write {1, n}. 

Method 2: Standard Symbols. Frequently used sets are usually given symbols that are reserved for them alone. For example, since we will be 
referring to the positive integers throughout this book, we wiU use the symbol IP instead of writing {1, 2, 3, ...).A few of the other sets of 
numbers that we will use frequently are: 

= the natural numbers = (0, 1, 2, 3, ...). 
Z = theintegers = {..., -3, -2, -1, 0, 1, 2, 3, ...} 
Q = the rational numbers. 
R = the real numbers. 
C = the complex numbers. 

Method 3: Set-Builder Notation. Another way of describing sets is to use set-builder notation. For example, we could define the rational 
numbers as 

Q= {a/b : a, beZ, bi=0] 
Note that in the set-buUder description for the rational numbers: 

(1) a/b indicates that a typical element of the set is a "fraction." 

(2) The colon is read "such that" or "where," and is used interchangeably with a vertical line, |. 

(3) a, e Z is an abbreviated way of saying a and b are integers. 

(4) All commas in mathematics are read as "and." 

The important fact to keep in mind in set notation, or in any mathematical notation, is that it is meant to be a help, not a hindrance. We hope 
that notation will assist us in a more complete understanding of the collection of objects under consideration and will enable us to describe it in 
a concise manner. However, brevity of notation is not the aim of sets. If you prefer to write a e Z and b instead of a, be Z, you should 
do so. Also, there are frequently many different, and equally good, ways of describing sets. For example, {jrelR| x^ -5x + 6 = 0} and 
{x I jreR: x^ - 5x + 6 = 0} both describe the solution set {2, 3}. 

A proper definition of the real numbers is beyond the scope of this text. It is sufficient to think of the real numbers as the set of points on a 
number line. The complex numbers can be defined using set-buUder notation as C = {a + bi:a, b e R), where = -1. 
In the following definition we wiU leave the word "finite" undefined. 

Definition: Finite Set. A set is a finite set if it has a finite number of elements. Any set that is not finite is an infinite set. 

Definition: Cardinality. Let A be a finite set. The number of different elements in A is called its cardinality and is denoted by | A | . 

As we win see later, there are different infinite cardinalities. We can't make this distinction now, so we wiU restrict cardinality to finite sets until 
later. 
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SUBSETS 

Definition: Subset. Let A and B be sets. We say that A is a subset ofB (notation A cB) if and only if every element of A is an element of 

B. 

Example 1.1.1. 

(a) If A = {3, 5, 8} andB = {5, 8, 3, 2, 6}, then A c B. 

(b) I^cZcQcRcC 

(c) IfA = {3, 5, 8} andB = {5, 3, 8},then A c B andB c A. 

Definition: Equality. Let A and B he sets. We say that A is equal to B (notation A = B) if and only if every element of A is an element of 
B and conversely every element ofB is an element of A; that is, A cB and B cA. 

Example 1.1.2. 

(a) In Example 1 .1 .Ic, A = B. Note that the ordering of the elements is unimportant. 

(b) The number of times that an element appears in an enumeration doesn't affect a set. For example, if A = {1, 5, 3, 5) and 6 = {1, 5, 3), 
then A = B. Warning to readers of other texts: Some books introduce the concept of a multiset, in which the number of occurrences of an 
element matters. 

A few comments are in order about the expression "if and only if" as used in our definitions. This expression means "is equivalent to saying," or 
more exactly, that the word (or concept) being defined can at any time be replaced by the defining expression. Conversely, the expression that 
defines the word (or concept) can be replaced by the word. 

Occasionally there is need to discuss the set that contains no elements, namely the empty set, which is denoted by the Norwegian letter 0. This 
set is also called the null set. 

It is clear, we hope, from the definition of a subset, that given any set A we have A c A and c A. Both and A are called improper subsets of 
A.lf B Q A, B t 0, and B t A, then B is called a proper subset of A. 

EXERCISES FOR SECTION 1.1 

A Exercises 

1. List four elements of each of the following sets: 

(a) {A: e P I ^ - 1 is a multiple of 7} 

(b) {x I xisa fruit and x ' s skin is normally eaten} 

(c) {xeQlxeZ.) 

(d) {2n\neZ, n < 0} 

(e) {s\s = I + 2 + ■■■ + n, neP] 

2. List all elements of the following sets: 

(a) {^1 we (3, 4, 5,6)) 

(b) (ae the alphabet | a precedes F) 

(c) {-k \ k eP] 

(d) (n^ I n = -2, -1, 0, 1, 2) 

(e) {n e IP | n is a factor of 24} 

3. Describe the following sets using set-builder notation. 

(a) { 5, 7, 9, ... , 77, 79} 

(b) the rational numbers that are strictly between -1 and 1 

(c) the even integers 

(d) {-18, -9,0, 9, 18, 27, . . .} 

4. Use set-builder notation to describe the following sets: 

(a) {1,2,3,4,5,6,7} 

(b) {1, 10, 100, 1,000, 10,000} 
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(c) {1,1/2,1/3,1/4,1/5,...} 

(d) {0} 

5. Let A = {0, 2, 3}, B = {2, 3}, and C = {1, 5, 9}. Detemiine which of the foUowing statements are true. Give reasons for your answers. 

a. 3 e A 

b. {3} e A 

c. {3} cA 
A. BqA 

e. AcB 

f. 0cC 

g. 0eA 

h. AcA 

C Exercise 

6. One reason that we left the definition of a set vague is Russell's Paradox. Many mathematics and logic books contain an account of this 
paradox. Two references are StoU and Quine. Find one such reference and read it. 
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1 1 .2 Basic Set Operations 

Definition: Intersection. Let A and B be sets. The intersection of A and B (denoted by A P\ B) is the set of all elements that are in both A 
and B. That is, A f] B = [x : x e Aandx e B). 

Example 1.2.1. 

(a) Let A = (1, 3, 8)andB = (-9, 22, 3). Then A fl B = |3). 

(b) Solving a system of simultaneous equations such as x + y = 1 and x - y = 3 can be viewed as an intersection. Let 
A = {(x, y) : X + y = 7 , X, y e R] and B = {(x, y) : x - y - 3, x, y e IR). These two sets are lines in the plane and their intersection, 
A f] B = {(5, 2)1, is the solution to the system. 

(c) znQ = z. 

(d) IfA = |3, 5, 9)andB = {-5, 8), then A fl B =0. 

Definition: Disjoint Sets. Two sete are disjoint if they have no elements in common (as in Example 1 .2.1 d). That is, A and B are disjoint 
ifAHB = 0. 

Definition: Union. Let A and B be sets. The union of A and B (denoted by A[J B) is the set of all elements that are in A or in B or in 
both A and B. That is, A[j B = [x : x e A orx e B). 

It is important to note in the set-builder notation for A[J B, the word "or" is used in the inclusive sense; it includes the case where x is in both A 
andB. 

Example 1.2.2. 

(a) If A = |2, 5, Sjandfi = |7, 5, 22), then A IJ B = |2, 5, 8, 7, 22|. 

(b) ZUQ=Q. 

(c) A U = ^ for any set A. 

Frequently, when doing mathematics, we need to establish a universe or set of elements under discussion. For example, the set 
A = {x : 81 x"* - 16 = 0) contains different elements depending on what kinds of numbers we allow ourselves to use in solving the equation 
81x*— 16 = 0. This set of numbers would be our universe. For example, if the universe is the integers, then A is 0. If our universe is the 
rational numbers, then A is {2/3, -2/3) and if the universe is the complex numbers, then A is |2/3, -2/3, 2i/3, — 2£/3). 

Definition: Universe. The universe, or universal set, is the set of all elements under discussion for possible membership in a set. 
We normally reserve the letter U for a universe in general discussions. 

VENN DIAGRAMS 

When working with sets, as in other branches of mathematics, it is often quite useful to be able to draw a picture or diagram of the situation 
under consideration. A diagram of a set is called a Venn diagram. The universal set U is represented by the interior of a rectangle and the sets 
by disks inside the rectangle. 

Example 1.2.3. 

(a) A {~\ B is illustrated in Figure 1 .2.1 by shading the appropriate region. 



AX""" 

























FIGURE 1.2.1 Venn diagram for intersection 



(b) A[j B is illustrated in Figure 1.2.2. 
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FIGURE 1 .2.2 Venn diagram for union 



In a Venn diagram, the region representing A f] B does not appear empty; however, in some instances it will represent the empty set. The same 
is true for any other region in a Venn diagram. 

Definition: Complement. Let A and B be sets. The complement of A relative to B (notation B - A) is the set of elements that are in B 
and not in A. That is, B - A = [x : x e B and x ^ A]. If U is the universal set, then U - A is denoted by A' and is called simply the comple- 
ment of A. A'' = [x £ U : X ^ A]. 

Example 1.2.4. 

(a) Let 5 = |1,2, 3, ... , 10) and A = (2,4,6,8, 10|.Thenfy-A = 1 1, 3, 5, 7, 9| and A - f/ = 

(b) If (/ = R, then the complement of the rational numbers is the irrational numbers. 

(c) W = and 0'' = U. 

(d) The Venn diagram of A - B is represented in Figure 1 .2.3. 



A 











FIGURE 1 .2.3 Venn diagram forA-B 
(e) The Venn diagram of A'' is represented in Figure 1 .2.4. 
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FIGURE 1 .2.4 Venn diagram for 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial- Share A like 3.0 United States License. 



18 



Chapter 1 - Set Theory 



(f) If B c A, then the Venn diagram of A - B is in Figure 1 .2.5 . 
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FIGURE 1 .2.5 Venn diagram for A - B where B is contained in A. 



(g) In the universe of integers, the set of even integers, |... , -4,-2, 0, 2, 4, ...), has the set of odd integers as its complement. 

Definition: Symmetric Difference. Let A and B be sets. The symmetric difference of A and B (denoted by A (BB) is the set of all elements 
that are in AandB but not in both. That is, A (B B = (A [j B) - (A f] B). 

Example 1.2.5. 

(a) Let A = |1, 3, 8)andB = |2, 4, 8).ThenAeB = |1, 2, 3, 4). 

(b) A © = A and A ® A = for any set A. 

(c) IR ffi Q = the irrational numbers. 

(d) The Venn diagram of A © B is represented in Figure 1 .2.6. 




'^1^ Mathematica Note 

One of the basic objects in Mathematica is a list. A list can be treated as a set. Here are a few examples. First we define A and B using the 
divisor function. This gives you all positive integers that divide evenly into a given positive integer, such as 525. 

A= Divisors [525] 

(1, 3, 5, 7, 15, 21, 25, 35, 75, 105, 175, 525| 
B = Divisors [300] 

(1, 2, 3, 4, 5, 6, 10, 12, 15, 20, 25, 30, 50, 60, 75, 100, 150, 300) 
Here are the union and intersection of the two sets: 
Union [A, B] 

(1, 2, 3, 4, 5, 6, 7, 10, 12, 15, 20, 21, 25, 30, 35, 50, 60, 75, 100, 105, 150, 175, 300, 525| 
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Intersection [A, B] 

{1, 3, 5, 15, 25, 75} 

Notice the outputs from both expressions automatically sort the elements in ascending order. The intersection is quite significant. It is the set 
of common divisors of both 525 and 300. The largest of these divisiors, 75, is the greatest common divisor of 525 and 300. 
Here is the complement of B with respect to A. 

Complement [A, B] 

{7, 21, 35, 105, 175, 525} 
There is no buUt-in Symmetric Difference function, but it can be defined: 

SymmetrlcDlf ference [X_, Y_] := Complement [Union [X, Y] , Intersection [X, Y] ] 
Now we can use the function: 

SymmetricDifference [A, B] 

{2, 4, 6, 7, 10, 12, 20, 21, 30, 35, 50, 60, 100, 105, 150, 175, 300, 525} 
The empty set is { }. 

SymmetricDifference [A, A] 



What corresponds to set-buUder notation in Mathematica is the function Select. For example, the set of all integers from 1 to 1000 whose 
digits add up to 10 could be described in set builder notation as {n e Z | 1 < n < 1000, sum of digits of n = 10}. We can produce that set 
using Select. 

Select [Range [1 , 1000], Function[n, Apply[Plus, IntegerDigits [n] ] == 10]] 

{19, 28, 37, 46, 55, 64, 73, 82, 91, 109, 118, 127, 136, 145, 154, 163, 172, 181, 190, 208, 217, 226, 
235, 244, 253, 262, 271, 280, 307, 316, 325, 334, 343, 352, 361, 370, 406, 415, 424, 433, 442, 451, 460, 
505, 514, 523, 532, 541, 550, 604, 613, 622, 631, 640, 703, 712, 721, 730, 802, 811, 820, 901, 910} 

An alternate form of the second argument to this expression, using a "pure function" is 

Select [Range [1 , 1000], Apply [Plus, IntegerDigits [B] ] == 10 &] 

{19, 28, 37, 46, 55, 64, 73, 82, 91, 109, 118, 127, 136, 145, 154, 163, 172, 181, 190, 208, 217, 226, 
235, 244, 253, 262, 271, 280, 307, 316, 325, 334, 343, 352, 361, 370, 406, 415, 424, 433, 442, 451, 460, 
505, 514, 523, 532, 541, 550, 604, 613, 622, 631, 640, 703, 712, 721, 730, 802, 811, 820, 901, 910} 

EXERCISES FOR SECTION 1.2 

A Exercises 

1. LetA = {0, 2, 3},B = {2, 3}, C = {1, 5, 9}, and let the universal set be i/ = {0, 1, 2, 9}. Determine: 

(a) A n B 

(b) A U B 

(c) S U A 

(d) A U C 

(e) A - B 

(f) B - A 
(g) 

(h) a 

(i) A n C 

(j) AeB 
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2. Let A, B, and C be as in Exercise 1, let D = {3, 2}, and let E = {2, 3, 2}. Detennine which of the following are true. Give reasons for 
your decisions. 

(a) A = B 

(b) B = C 

(c) B = D 

(d) E = D 

(e) AnB = Bf]A 

(f) AUB = BUA 

(g) A-fi = B-A 

(h) A e fi = fi e A 

3. Let 6' = {1, 2, 3, . . . , 9} . Give examples of sets A, 6, and C for which: 

(a) A n (B n c) = (A n B) n c 

(b) An(BUc)=(AnB)U(Anc) 

(c) (A U B)" = A<^ n B' 

(d) A U = i/ 

(e) A c A U B 

(f) ApB cA 

4. Let f/ = (1, 2, 3, 9). Give examples to illustrate the following facts: 

(a) If A c Band 5 cC.thenAcC. 

(b) A - B i= B - A 

(c) liU = A U B and A n B = , it always follows that A = i/ - B. 

(d) Ax (B DC) = (Ax B)ri (A xC) 

B Exercises 

5. What can you say about A if 1/ = {1, 2, 3, 4, 5},B = {2, 3}, and (separately) 

(a) A UB = {1, 2, 3,4} 

(b) A n B = {2} 

(c) A e B = {3, 4, 5} 

6. Suppose that is an infinite universal set, and A and B are infinite subsets of U. Answer the following questions with a brief explanation. 

(a) Must A'' be finite? 

(b) MustAIJB infinite? 

(c) Must A n B be infinite? 

7. Given that U = all students at a university, D = day students, M = mathematics majors, and G = graduate students. Draw Venn diagrams 
illustrating this situation and shade in the following sets: 

(a) evening students 

(b) undergraduate mathematics majors 

(c) non-math graduate students 

(d) non-math undergraduate students 

8. Let the sets D,M,C, and J/ be as in exercise 7. Let | (7 | = 16, 000 , | £) | = 9,000, \ M \ = 300, and \ G \ = 1 , 000. Also assume that 
the number of day students who are mathematics majors is 250, fifty of whom are graduate students, that there are 95 graduate mathematics 
majors, and that the total number of day graduate students is 700. Determine the number of students who are: 

(a) evening students 
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(b) nonmathematics majors 

(c) undergraduates (day or evening) 

(d) day graduate nonmathematics majors 

(e) evening graduate students 

(f) evening graduate mathematics majors 

(g) evening undergraduate nonmathematics majors 

C Exercise 

9. (a) Evaluate the following expressions in Mathematica to learn more about Select and to learn about PrlmeQ. 

? Select 
? PrlmeQ 

(b) Use Mathematica to list all primes between 2000 and 2099, inclusive. 

10. (a) Evaluate the following expression in Mathematica to learn about SquareFree 

? SquareFreeQ 

(b) Use Mathematica to list all square-free integers between 2000 and 2099, inclusive. 
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1 1 .3 Cartesian Products and Power Sets 

Definition: Cartesian Product. Let A and B he sets. The Cartesian product of A and B, denoted by AxB, is defined as follows: 
AxB = {(a, V) \ a G A andb e B], that is, AxB is the set of all possible ordered pairs whose first component comes from A and whose 
second component comes from B. 

Example 1.3.1. Notation in mathematics is often developed for good reason. In this case, a few examples wiU make clear why the symbol 
X is used for Cartesian products. 

(a) Let A = {1,2,3} and B = {4,5}.Then AxB = {(l,4),(l,5),(2,4),(2,5),(3,4),(3,5)}.Note that I AxB I = 6 = |A| x \B\. 

(b) AxA = {(1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3)}. Note that |AxA| = 9 = |Ap. 

These two examples illustrate the general rule: If A and B are finite sets, then | A xB | =6 = |A| x |B|. 

We can define the Cartesian product of three (or more) sets similarly. For example, A x B x C = {(a, b, c) : a & A, b & B, c & C]. It is 
common to use exponents if the sets in a Cartesian product are the same: 

A^ = A X A , 
A? =A xA xA 

and in general, 

A" = ((oi , 02 , . . . , a„ ) : each a e A). 

Power Sets 

Definition: Power Set. If A is any set, the power set of A is the set of all subsets of A, denoted P{A). 
The two extreme cases, theempty set and all of A, are both included in ViA). 
Example 132. 

(a) ?'(0) = I01 

(b) nW) = (0, (11) 

(c) :P((1,2)) = (0, (1), (2), (1, 2)). 

We wiU leave it to you to guess at a general formula for the number of elements in the power set of a finite set. In Chapter 2, we wiU discuss 
counting rules that wiU help us derive this formula. 

EXERCISES FOR SECTION 1.3 

A Exercises 

1. Let A = {0, 2, 3),B = {2, 3), C = (1, 4), and let the universal set be U = {0, 1, 2, 3, 4). List the elements of 

(a) A xB 

(b) B X A 

(c) A xBxC 

(d) C/X0 

(e) Ax A' 

(f) B2 

(g) B' 

(h) BxP(B) 

2. Suppose that you are about to flip a coin and then roU a die. Let A = {HEAD, TAIL} and B = {1, 2, 3, 4, 5, 6}. 

(a) What is | A x B |? 

(b) How could you interpret the set A x B ? 

3. List all two-element sets in P({a, b, c]) 

4. List all three-element sets in ^ ({a, b, c,d]). 
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5. How many singleton (one-element) sets are there in A) if \A\ =n ? 

6. A person has four coins in his pocket: a permy, a nickel, a dime, and a quarter. How many different sums of money can he take out if he 
removes 3 coins at a time? 

7. Let A = {+, -}andB = {00, 01, 10, 11}. 

(a) List the elements of A x B 

(b) How many elements do A and (A xB)^ have? 
B Exercises 

8. Let A = {•,□, ®) andB = (□, 0, •}. 

(a) List the elements of A x B and B x A. The parentheses and comma in an ordered pair are not necessary in cases such as this where the 
elements of each set are individual symbol. 

(b) Identify the intersection of A x B and B x A. for the case above, and then guess at a general rule for the intersection of A x B and B x A. 

where A and B are any two sets. 

9. Let A and B be nonempty sets. When are A x B and B x A. equal? 
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1 1.4 Binary Representation of Positive Integers 

Recall that the set of positive integers, IP, is {1, 2, 3, . . . }. Positive integers are naturally used to count things. There are many ways to count 
and many ways to record, or represent, the results of counting. For example, if we wanted to count five hundred twenty-three apples, we might 
group the apples by tens. There would be fifty-two groups of ten with three single apples left over. The fifty-two groups of ten could be put into 
five groups of ten tens (hundreds), with two tens left over. The five hundreds, two tens, and three units is recorded as 523. This system of 
counting is called the base ten positional system, or decimal system. It is quite natural for us to do grouping by tens, hundreds, thousands, . . . , 
since it is the method that all of us use in everyday life. 

The term positional refers to the fact that each digit in the decimal representation of a number has a significance based on its position. Of course 
this means that rearranging digits will change the number being described. You may have learned of numeraton systems in which the position 
of symbols does not have any significance (e.g., the ancient Egyptian system). Most of these systems are merely curiosities to us now. 

The binary number system differs from the decimal number system in that units are grouped by twos, fours, eights, etc. That is, the group sizes 
are powers of two instead of powers of ten. For example, twenty-three can be grouped into eleven groups of two with one left over. The eleven 
twos can be grouped into five groups of four with one group of two left over. Continuing along the same lines, we find that twenty-three can be 
described as one sixteen, zero eights, one four, one two, and one one, which is abbreviated 10 1 1 ltwo> or simply 101 1 1 if the context is clear. 

The process that we used to determine the binary representation of 23 can be described in general terms to determine the binary representation 
of any positive integer n. A general description of a process such as this one is called an algorithm. Since this is the first algorithm in the book, 
we will first write it out using less formal language than usual, and then introduce some "algorithmic notation." 

Step One: Start with an empty list of bits. 
Step Two: Assign the variable k the value n. 

Step Three: While k's value is positive, continue doing the following three steps, and when k becomes zero, stop. First, divide k by 2, obtaining 
a quotient q (often denoted k div 2) and a remainder r (denoted k mod 2). Second, attach r to the left-hand side of the list of bits. Third, assign 
the variable k the value of q. 

Example lA.l. Determine the binary representation of 41 . 



k =2xq +r — > 


add r to the list 


41 = 2 X 20 -1- 1 


List = 1 


20 = 2 X 10 -1- 


List = 01 


10 = 2x5 -1- 


List = 001 


5 = 2x2-1-1 


List =1001 


2 =2x 1 -1- 


List = 01001 


1 = 2 xO -F 1 


List = 101001 


Therefore, 41 = 101001,^0 



The notation that we will use to describe this algorithm and all others is called pseudocode, an informal variation of the instructions that are 
commonly used in many computer languages. Read the following description carefully, comparing it with the informal description above. 
Appendix B, which contains a general discussion of the components of the algorithms in this book, should clear up any lingering questions. 
Anything after // are comments. 

Algorithm 1.4.1: Algorithm for Determining the Binary Representation of a Positive Integer. 

Input: a positive integer n. 

Output: the binary representation of n in the form of a list of bits, with units bit last, twos bit next to last, etc. 

(1) k:=n //initialize k 

(2) L := an empty list //initialize L 

(3) While k > do 

(3.1) q := k div 2 //divide k by 2 
r:= k mod 2 

(3.2) L: = prepend r to L //Add r to the front of L 

(3.3) k:= q //reassign k 
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Now that you've read this section, you should get this joke from the xkcd.com. 



ON A SCALE OF lib 10, 

NOW LiKav IS ir TPwr 
THIS Ga;esnoNi is 

USING BlNART? 




EXERCISES FOR SECTION 1.4 
A Exercises 

1. Find the binary representation of each of the following positive integers: (a) 31 (b) 32 (c) 10 (d) 100 

2. Find the binary representation of each of the following positive integers: (a) 64 (b) 67 (c) 28 (d) 256 

3. What positive integers have the following binary representations? (a) 10010 (b) 1001 1 (c) 101010 (d) 1001 1 1 10000 

4. What positive integers have the following binary representations? (a) 100001 (b) 1001001 (c) 1000000000(d) 1001110000 

5. The example, decimal 1000 has 10 bits since it is 1111 lOlOOOnvo You might save some time by thinking of how 10 can be arrived at 
without finding the exact binary representation. How many bits do the binary representations of the following decimal numbers have? 

(a) 201 1 (b) 4000 (c) 4500 (d) 2'" 

B Exercises 

6. Let m be a positive integer with n-bit binary representation: a„_i a„_2 ••• oi Qq with a„_i = 1 What are the smallest and largest values that m 
could have? 

7. If a positive integer is a multiple of 100, we can identify this fact from its decimal representation, since it will end with two zeros. What can 
you say about a positive integer if its binary representation ends with two zeros? 

8. Can a multiple of ten be easily identified from its binary representation? 



® 
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1 1.5 Summation Notation and Generalizations 

Most operations such as addition of numbers are introduced as binary operations. That is, we are taught that two numbers may be added 
together to give us a single number. Before long, we run into situations where more than two numbers are to be added. For example, if four 
numbers, ai, 02, 03, and 04 are to be added, their sum may be written down in several ways, such as ((ai + 02) + 03) + 04 or 
(ai + 02) + (03 + 04). In the first expression, the first two numbers are added, the result is added to the third number, and that result is added to 
the fourth number. In the second expression the first two numbers and the last two numbers are added and the results of these additions are 
added. Of course, we know that the final results will be the same. This is due to the fact that addition of numbers is an associative operation. For 
such operations, there is no need to describe how more than two objects will be operated on. 

4 

A sum of numbers such as ai + ^2 + 03 + 04 is called a series and is often written X oj. in what is called summation notation. 

We first recall some basic facts about series that you probably have seen before. A more formal treatment of sequences and series is covered in 
Chapter 8. The purpose here is to give the reader a working knowledge of summation notation and to carry this notation through to intersection 
and union of sets and other mathematical operations. 

n 

A finite series is an expression such as ai + 02 + 03 + ... +a„ = 2 O/t 

k=l 

In 2 a„ / is referred to as the index, or the index of summation; the value below the summation symbol is the initial index and the value above 
the summation symbol is the terminal index. The a, are called the terms of the series. The initial index of a summation may be different from 1 . 

Example 1.5.1. 

4 

(a) 2 a,- = ai + 02 + 03+04 
(=1 

5 

(b) ^bi, = bo + bi+b2 + h+b4 + 

2 

(c) 2; C; = C_2 + C_i + Co + Ci + C2 

i=-2 

Example 1.5.2. If the terms in series are more specific, the sum can often be simplified. For example, 

(a) 2/2^12 + 22 + 32 + 42 = 30 

i=l 
5 

(b) 2; (2 - 1) = (2 X 1 - 1) + (2 X 2 - 1) + (2 X 3 - 1) + (2 X 4 - 1) + (2 X 5 - 1) 

i=l 

=1+3+5+7+9 
= 25 

Summation notation can be generalized to many mathematical operations, for example, 

AinA2nA3nA4= riA; 

Definition: Generalized Set Operations. Let A], A2, A„ be sets, then: 

(1) A, nA2n •••nA„ = nA,■ 
i=^ 

(1) A, UA2U •••UA„ = UAi 

i=l 

n 

(1) AiXA2X---xA„ = X Ai 

i=l 

(1) Aj 9A2 e---®An = SAi 

i=l 

Example 1.53. If Ai = {0, 2, 3},A2 = {1, 2, 3, 6},andA3 = {-1, 0, 3, 9}, then 

nAi=AinA2nA3 = {3) 

!=1 

and 
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UA=AiUA2UA3 = {-l,0, 1,2,3,6,9) 

;=1 

With this notation it is quite easy to write lengthy expressions in a fairly compact form. For example, the statement 

A n (fii u fi2 u - u B«) = (A n Bi) u (A n B2) u ••• u (A n b«) 

becomes 

Af]\UBi\= U(AnBn) 



(n \ n 
UBi = U 
i=l / i=l 



EXERCISES FOR SECTION 1.5 
A Exercises 

1 . Calculate the foUowing series: 
(a) 2(2 + 3!) 



1 

2 



(b) Zr 

i=-2 



(c) E V for n= 1, 2, 3, 4 

7=0 

(d) Z(2^- 1) for n = 1, 2, 3, 4 
2. Calculate the following series: 



(a) 


Z i" f ' 










(b) 


Z20 




i=l 






(c) 


Z(n- 




J=0 



(d) Z ^ for n = 1, 2, 3, 4 



3. (a) Express the formula Z = — without using summation notation. 

(=1 

(b) Verify this formula forn = 3. 

(c) Repeat parts (a) and (b) for Z = " 

4. Verify the following properties forn = 3. 

(a) ^ioi+bi) = Za, + Z bi 

1=1 ;=1 1=1 



(11 ^ n 
Zfli = Zca; 
i=l / i=l 

5. Rewrite the following without summation sign forn = 3. It is not necessary that you imderstand or expand the notation ^ ^ j at this point. 



(x + y)»= Zf ? ]^""*/ 



k=0 

4 

6. (a) Draw the Verm diagram for f} Ai. 

i=l 
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(b) Express in "expanded format": 



7. For any positive integer A:, let = {j; e Q : k - I <x<k} and Bj^ = {j; e Q : -fc < a: < fc}. What are the following sets? 

(a) UA,- 
(=1 



(b) US, 

(c) n^,- 
(=1 

(d) riBi 



8. For any positive integer k, let A = {xeQ: 0<x< 1 / k] and Bi^ = {xeQ: 0<x< /c). What are the following sets? 

00 

(a) UA,- 
(=1 



(b) U5, 

(=1 

oo 

(c) PA,- 
(=1 

00 

(d) 

9. The symbol n is used for the product of numbers in the same way that £ is used for sums. For example, 

5 

YlXi = XiX2X3X4Xi 
(=1 

Evaluate the following: 
(a) 

(b) n(2/+l) 
(=1 

10. Evaluate 

(a) n2* 

k=0 
100 , 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 1 
Section 1 .1 

1 . Enumerate the elements in the following sets: 

(a) {;i:eR|;i:^ - 3;c + 2 = 0} 

(b) {x&R\x^ + 1 = 0} 

(c) {;ceC|;c2 + 1 = 0} 

Section 1 .2 

2. Lett/ = (0,1,2,3 9), A = {0, 2, 3),B = {2, 3), C = (1, 5, 9). 

Determine: 

(a) (A U B) n C 

(b) A<^ n B'^ 

(c) (A [J BY 

3. Leti/={0, l,2,3,4,5,6,7,8,9},A = {;cei/:xisamultipleof3},andB = {;cei/:;c2 - 5 > 0}. 
Determine: 

(a) AUB 

(b) AnB 

(c) 

4. Let A, 6, and C be subsets of some universal set U. Draw Venn diagrams to illustrate each of the following expressions: 

(a) (A n Bf (e) A'' R B'' fl C 

(b) A- U (f) A'- n B'' U C'' 

(c) A n (B u c) (g) A'' n (B u cr 

(d) (A n B) u (A n o (h) (A<^ n bo u (a-^ n n 

5. Let A = {1,2,3,4,5,6}, B = {2,5,7,9, 12}, and C= {4,7,9,15,23}. 

(a) Verify the formula: #(A U B) = #A + #B - #(A fl B) for this 
example. 

(b) Use the formula in part (a) to compute: 

#(AUC),#(BUC),#(AnB) 
(c) Use the formula of part (a) to derive a formula for #(A IJ B U Q- 
Verify your formula for this example. 

6. (7= {1,2, 3, 4, 5, 6, 7,8}, A = {a e i/ | is even}, andB = {a e C/ I a + 1 is a multiple of three} 

(a) A = { } (List) 

(b) B = { } (List) 

(c) A<^ = 

(d) AnB= 

Section 1 .3 

7. Let i/ = {0, 1, 2, 3, . . . , 9}, A = {2, 4, 6}, B = {4}, C = {1, 5}. 
Determine: 

(a) B^ (d)BxA 

(b) A X B (e) (A X B) X C 
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(C) B3 

8. Let A, B, and C be sets where |A| = 2, |B| = 3,and |C| =4. 

(a) Give an example showing that AxBxC i= AxC xB, and explain why the two sets are not equal. 

(b) Is \AxBxC\ = \AxC xB\ 

9. Let A = {1,2}. Determine | 'P (A) x 'P (A) | and then list all elements of P(A) x P(A). 

10. Let A = {a, b, c, d}. Determine which of the following are true: 

(a) {b} e P{A) (d) b e PiA) 

(b) AeP(A) (e)0eP(A) 

(c) P(A) e P(A) (f) P(A) c P(A) 

Section 1 .4 

1 1 . The addition rules for binary numbers are: 

+ = 0, 

1+0 = 0+1 = 1, 

1 + 1=0 and carry 1 to the next column 
and 1 + 1 + 1 = 1 and carry 1 to the next column. 
For example: 11111 < — carries 

31te„= 11111 
ll,e„ = 1011 

101010 = 42te„ 

Express and compute the following sums with binary numbers; verify your result by converting back to base ten. 
(a) 31 +32 (b)64 + ll (c) 13 + 15 + 9 

12. Compute the following sums and express the result in both binary and decimal form. 

(a) 10010 + 10011 (c) 1101.11 + 100.1 

(b) 101001 + 1101 

13. Multiphcation of binary numbers is a process similar to decimal multiphcation. The rules for binary multiplication are: 0=0, 
l- = 0-1 =0, 1-1 = 1. Compute the following products and verify by converting numbers to base ten: 

(a) 1001 11 (b) 1001 ■ 1101 

14. How are multiphcation and division by two accompUshed in the binary number system? 

Section 1 .5 

15. Let A„ = {k <=^ : k < 3 «} for each n e f^. List the elements, 
(a) A„forn = 0, 1,2,3 



3 



(b) 



r\A, 



n=l 



3 



r\A, 



n=0 



3 



(d) 



UA, 



In 



n=l 



(e) {JA„ 



,1=0 

16. Expand each of the following and convince yourself that they are true. Assume that m, n e M with m < n, and c, x„ e 
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n n—\ 

(a) = + x„ 

(=1 i=l 

n n n 

(b) Y^Xi + yd = Y/i + 

(=1 i=l i=l 

rt n 

(c) ^cj:; = c^j:; 

1=1 i=l 

(d) = Yj^i + X ^' 

(=1 i=l i=m+l 

17. Which of the above are true if + is replaced with ■ and E is replaced with 0? Explain. 
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chapter 2 




COMBINATORICS 

GOALS 

Throughout this book we will be counting things. In this chapter we will outline some of the tools that will help us count. 

Counting occurs not only in highly sophisticated applications of mathematics to engineering and computer science but also in many basic 
applications. Like many other powerful and useful tools in mathematics, the concepts are simple; we only have to recognize when and how they 
can be applied. 

I 2.1 Basic Counting Techniques— The Rule of Products 

WHAT IS COMBINATORICS? 

One of the first concepts our parents taught us was the "art of counting." We were taught to raise three fingers to indicate that we were three 
years old. The question of "how many" is a natural and frequently asked question. Combinatorics is the "art of counting." It is the study of 
techniques that will help us to count the number of objects in a set quickly. Highly sophisticated results can be obtained with this simple 
concept. The following examples will illustrate that many questions concerned with counting involve the same process. 

Example 2.1.1. A snack bar serves five different sandwiches and three different beverages. How many different lunches can a person 
order? One way of determining the number of possible lunches is by listing or enumerating all the possibilities. One systematic way of doing 
this is by means of a tree, as in Figure 2.1.1. 
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FIGURE 2.1.1 Tree solution for Example 2.1.1 

Every path that begins at the position labeled START and goes to the right can be interpreted as a choice of one of the five sandwiches 
followed by a choice of one of the three beverages. Note that considerable work is required to arrive at the number fifteen this way; but we 
also get more than just a number. The result is a complete list of all possible lunches. If we need to answer a question that starts with "How 
many . . . ," enumeration would be done only as a last resort. In a later chapter we will examine more enumeration techniques. 

An alternative method of solution for this example is to make the simple observation that there are five different choices for sandwiches and 
three different choices for beverages, so there are 5 • 3 = 15 different lunches that can be ordered. 

A listing of possible lunches a person could have is: {(BEEF, milk), (BEEF, juice), (BEEF, coffee), (BOLOGNA, coffee)). 

Example 2.1.2. Let A = [a, b, c, d, e) and B = {1, 2, 3}. From Chapter 1 we know how to list the elements in 
A X B = ((a, 1), (a, 2), (a, 3), (e, 3)). The reader is encouraged to compare Figure 2.1.1 for this example. Since the first entry of 
each pair can be any one of the five elements a, b, c, d, and e, and since the second can be any one of the three numbers 1 , 2, and 3, it is quite 
clear there are 5 • 3 = 1 5 different elements in A x B . 

Example 2.1.3. A person is to complete a true-false questionnaire consisting often questions. How many different ways are there to 
answer the questionnaire? Since each question can be answered either of two ways (true or false), and there are a total of ten questions, there 
are 2-2-2-2-2-2-2-2-2-2 = 2'" = 1024 different ways of answering the questionnaire. The reader is encouraged to visualize the tree 
diagram of this example, but not to draw it! 

We formalize the procedures developed in the previous examples with the following rule and its extension. 
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THE RULE OF PRODUCTS 

Rule Of Products: If two operations must be performed, and If the first operation can always be performed pi different ways and the second 

operation can always be performed p2 different ways, then there are p\- p2 different ways that the two operations can be performed. 

Note: It is important that p2 does not depend on the option that is chosen in the first operation. Another way of saying this is that p2 is indepen- 
dent of the first operation. If p2 is dependent on the first operation, then the rule of products does not apply. 

Example 2.14. Assimie in Example 2.1.1 that coffee is not served with a beef or chicken sandwich, then by inspection of Figure 2.1.1 
we see that there are only thirteen different choices for lunch. The rule of products does not apply, since the choice of beverage depends on 

one's choice of a sandwich. 

Extended Rule Of Products. The rule of products can be extended to include sequences of more than two operations. If n operations must be 
performed, and the number of options for each operation is pi, p2, ■■■ , and respectively, with each pi independent of previous choices, then 
the n operations can be performed 

n 

Pl-P2 Pn = n Pi 

i=l 

Example 2.1.5. A questioimaire contains four questions that have two possible answers and three questions with five possible answers. 
Since the answer to each question is independent of the answers to the other questions, the extended rule of products applies and there are 
2-2-2-2-5-5-5 = 2*5^ = 2000 different ways to answer the questioimaire. In Chapter 1 we introduced the power set of a set A, P(A), 
which is the set of all subsets of A. Can we predict how many elements are in P(A) for a given finite set A? The answer is yes, and in fact 
I P(_A) I =2". The ease with which we can prove this fact demonstrates the power and usefulness of the rule of products. Do not underesti- 
mate the usefulness of simple ideas. 

Theorem 2.1.1. If A is a finite set, then | P^A) \ =2^ . 

Proof: Let B <= PiA) and assume \A \ = n. Then for each element xeA there are two choices, either x e B of x ^ B. Since there are n 
elements of A we have , by the rule of products , 2-2 2 = 2" different subsets of A. Therefore, | P(A) | = 2" ■ 

n factore 

EXERCISES FOR SECTION 2.1 
A Exercises 

1. In horse racing, to bet the "daily double" is to select the winners of the first two races of the day. You win only if both selections are correct. 
In terms of the number of horses that are entered in the first two races, how many different daily double bets could be made? 

2. Professor Shortcut records his grades using only his students' first and last initials. What is the smallest class size that wiU definitely force 
Prof. S. to use a different system? 

3. A certain shirt comes in four sizes and six colors. One also has the choice of a dragon, an alligator, or no emblem on the pocket. How many 

different shirts could you order? 

4. A builder of modular homes would like to impress his potential customers with the variety of styles of his houses. For each house there are 
blueprints for three different living rooms, four different bedroom configurations, and two different garage styles. In addition, the outside can 

be finished in cedar shingles or brick. How many different houses can be designed from these plans? 

5. The Pi Mu Epsilon mathematics honorary society of Outstanding University wishes to have a picture taken of its six officers. There will be 
two rows of three people. How many different way can the six officers be arranged? 

6. An automobile dealer has several options available for each of three different packages of a particular model car: a choice of two styles of 
seats in three different colors, a choice of four different radios, and five different exteriors. How many choices of automobile does a customer 
have? 

7. A clothing manufacturer has put out a mix-and-match collection consisting of two blouses, two pairs of pants, a skirt, and a blazer. How 
many outfits can you make? Did you consider that the blazer is optional? How many outfits can you make if the manufacturer adds a sweater to 

the collection? 

8. As a freshman, suppose you had to take two of four lab science courses, one of two literature courses, two of three math courses, and one of 
seven physical education courses. Disregarding possible time conflicts, how many different schedules do you have to choose from? 

9. (a) Suppose each single character stored in a computer uses eight bits. Then each character is represented by a different sequence of eight O's 
and I's called a bit pattern. How many different bit patterns are there? (That is, how many different characters could be represented?) 

(b) How many bit patterns are palindromes (the same backwards as forwards)? 

(c) How many different bit patterns have an even number of I's? 

10. Automobile license plates in Massachusetts usually consist of three digits followed by three letters. The first digit is never zero. How many 
different plates of this type could be made? 

11. (a) Let A = {a, b, c, d}. Determine the number of different subsets of A. 
(b)LetA = {1, 2, 3, 4, 5}. Determine the number ofproper subsets of A. 

12. How many integers from 100 to 999 can be written with no 7's? 
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13. Consider three persons, A, B, and C, who are to be seated in a row of three chairs. Suppose A and B are identical twins. How many seating 
arrangements of these persons can there be: 

(a) If you are a total stranger? 

(b) If you are A and B's mother? 

(This problem is designed to show you that different people can have different correct answers to the same problem.) 

14. How many ways can a student do a ten-question true-false exam if he or she can choose not to answer any number of questions? 

15. Suppose you have a choice of fish, lamb, or beef for a main course, a choice of peas or carrots for a vegetable, and a choice of pie, cake, or 
ice cream for dessert. If you must order one item from each category, how many different dinners are possible? 

16. Suppose you have a choice of vanilla, chocolate, or strawberry for ice cream, a choice of peanuts or walnuts for chopped nuts, and a choice 
of hot fudge or marshmaUow for topping. If you must order one item from each category, how many different sundaes are possible? 

B Exercises 

17. A questionnaire contains six questions each having yes-no answers. For each yes response, there is a follow-up question with four possible 

responses. 

(a) Draw a tree diagram that illustrates how many ways a single question in the questionnaire can be answered. 

(b) How many ways can the questionnaire be answered? 

18. Ten people are invited to a dinner party. How many ways are there of seating them at a round table? If the ten people consist of five men 
and five women, how many ways are there of seating them if each man must be surrounded by two women around the table? 

19. How many ways can you separate a set with n elements into two nonempty subsets if the order of the subsets is immaterial? What if the 
order of the subsets is important? 

20. A gardener has three flowering shrubs and four nonflowering shrubs. He must plant these shrubs in a row using an alternating pattern, that 
is, a shrub must be of a different type from that on either side. How many ways can he plant these shrubs? If he has to plant these shrubs in a 
circle using the same pattern, how many ways can he plant this circle? Note that one nonflowering shrub will be left out at the end. 
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I 2.2 Permutations 

A number of applications of the rule of products are of a specific type, and because of their frequent appearance they are given a designation all 
their own— permutations. Consider the following examples. 

Example 2.2.1. How many different ways can we order the three different elements of the set A = {a, b, c}? Since we have three choices 
for position one, two choices for position two, and one choice for the third position, we have, by the rule of products, 3-2-1 = 6 different ways 
of ordering the three letters. We illustrate through a tree diagram: 



C ¥ ^ u |5» Film** trti 




Figure 2.2.1 

Each of the six orderings is called a permutation of the set A. 

Example 2.2.2. A student is taking five courses in the fall semester. How many different ways can the five courses be listed? There are 
5x4x3x2x1= 120 different permutations of the set of courses. 

In each of the above examples of the rule of products we observe that: 

(1) We are asked to order or arrange elements from a single set. 

(2) Each element is listed exactly once in each list (permutation). So if there are n choices for position one in a list, there are n - 1 choices for 
position two, n - 2 choices for position three, etc. 

Example 2.2.3. The alphabetical ordering of the players of a baseball team is one permutation of the set of players. Other orderings of 
the players' names might be done by batting average, age, or height. The information that determines the ordering is called the key. We would 
expect that each key would give a different permutation of the names. If there are twenty-five players on the team, there are 
25 • 24 • 23 2-1 different permutations of the players. 

We now develop notation that will be useful for permutation problems. 

Definition: Factorials. If n is a positive integer then n factorial is the product of the first n positive integers and is denoted ni. Addition- 
ally, we define zero factorial to be 1 . 

0! = 1 

1! = 1 

2! =2-1=2 

3! = 3-2-1 =6 

n\ = n-{n- \)-{n-2) 2 • 1 = {{k 

k=\ 

Note that 4! is 4 times 3!, or 24, and 5 ! is 5 times 4!, or 120. In addition, note that as n grows in size, n \ grows extremely quickly. For 
example , 11! = 39916 800 . If the answer to a problem happens to be 25 ! , for example , you would never be expected to write that number out 
completely. However, 

25' 

a problem with an answer of — can be reduced to 25 • 24, or 600. 

So if I A I = «, there are n \ ways of permuting all n elements of A. We next consider the more general situation where we would like to 
permute k elements of a set of n objects, k<n. 

Example 2.2.4. A club of twenty-five members will hold an election for president, secretary, and treasurer in that order. Assume a 
person can hold only one position. How many ways are there of choosing these three officers? By the rule of products there are 25 • 24 • 23 
ways of making a selection. 

Definition: Permutation. An ordered arrangement ofk elements selected from a set of n elements, < k < n, where no two elements of 
the arrangement are the same, is called a permutation of n objects taken k at a time. The total number of such permutations is denoted by 
Pin: k). 

Theorem 22.1. The number of possible permutations of k elements taken from a set of n elements is 
P(n: k) = n-(n-l)-(n-2) {n-k+l)= = z'(n-i) 

Proof: Case I: If /: = n we have P (n; n) = = n] , which is simply the rule of products as applied in Example 2.2.3. 
Case II: If < ^ < « then, as in Example 2.2.4, we have k positions to fill with n elements and 
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position 1 can be filled by any one of n elements 
position 2 can be filled by any one of n - 1 elements 
position 3 can be filled by any one of n - 2 elements 

position k can be filled by any one oin - k + \ elements. 
Hence, by the rule of products, = n -{n - - 2) {n - k + 1). Also, 

n! _ n (n- 1 ) («-2) 1 ) (n-t) 21 

(ji-k)\ ~ {n-k)\ 

= n -(n - l)-(n - 2) (n - k + 1) ■ 

It is important to note that the derivation of the permutation formula given in Theorem 2.2.1 was done solely through the rule of products. This 
serves to reiterate our introductory remarks in this section that permutation problems are really rule-of-products problems. Every permutation 
problem can be done by the rule of products. We close this section with several examples solved by both methods. 

Example 2.2.5. A club has eight members eligible to serve as president, vice-president, and treasurer. How many ways are there of 
choosing these officers? 

Solution 1: Using the rule of products. There are eight possible choices for the presidency, seven for the vice-presidency, and six for the 

office of treasurer. By the rule of products there are 8 x7 x6 = 336 ways of choosing these officers. 

Solution 2: Using the permutation formula. We want the total number of permutations of eight objects taken tiiree at a time: 

P(8, 3) = — = 8x7x6x5! ^g.-^.g ^33(5 

(8-3)! 5! 

Example 2.2.6. Example 2.2.2 revisited. Solution 2: Using the permutation formula. We want the total number of permutations of five 
courses taken five at a time: 

^(5; 5) = ^ = |=120 
Example 2.2.7. Consider the digits 1, 2, 3, 4, and 5. 

(a) How many three-digit numbers can be formed if no repetition of digits can occur? 

(b) How many three-digit numbers can be formed if repetition of digits is allowed? 

Solution to (a): 

Solution 1: Using the rule of products. We have any one of five choices for digit one, any one of four choices for digit two, and three choices 
for digit three. Hence, 5 -4 • 3 = 60 different three-digit numbers can be formed. 

Solution 2; Using the permutation formula. We want the total number of permutations of five digits taken three at a time: 
^(5^3)=^ = 1= ^x4x3x2x1 ^5^4^3 ^ 60 

Solution to (b): The definition of permutation indicates ". . .no two elements in each list are the same." Hence the permutation formula cannot 
be used. However, the rule of products still applies. We have any one of five choices for the first digit, five choices for the second, and five 
for the third. So there are 5-5-5 = 125 possible different three-digit numbers if repetition is allowed. 

EXERCISES FOR SECTION 2.2 
A Exercises 

1 . If a raffle has three different prizes and there are 1 ,000 raffle tickets sold, how many different ways can the prizes be distributed? 

2. (a) How many three-digit numbers can be formed from the digits 1 , 2, 3 if no repetition of digits is allowed? List the three-digit numbers. 

(b) How many two-digit numbers can be formed if no repetition of digits is allowed? List them. 

(c) How many two-digit numbers can be obtained if repetition is allowed? 

3. How many eight-letter words can be formed from the 26 letters in the alphabet? Even without concerning ourselves about whether the words 
make sense, there are two interpretations of this problem. Answer both. 

4. Let A be a set with \ A \ =n . 

(a) Determine | A' | . 

(b) Determine | {{a, h, c)eA^ | each coordinate is different) |. 

5. The state finals of a high school track meet involves fifteen schools. How many ways can these schools be listed in the program? 
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6. Consider the three-digit numbers that can be formed from the digits 1,2,3,4,5 with no repetition of digits allowed. 

(a) How many of these are even numbers? 

(b) How many are greater than 250? 

7. (a) How many ways can the coach at Tall U. fill the five starting positions on a basketball team if each of his 15 players can play any 
position? (b) What is the answer if the center must be one of two people? 

8. (a) How many ways can a gardener plant five different species of shrubs in a circle? 

(b) What is the answer if two of the shrubs are the same? 

(c) What is the answer if all the shrubs are identical? 

9. The president of the Math and Computer Club would like to arrange a meeting with six attendees, the president included. There wiU be three 
computer science majors and three math majors at the meeting. How many ways can the six people be seated at a circular table if the president 
does not want people with the same majors to sit next to one other? 

B Exercises 

10. Six people apply for three identical jobs and all are qualified for the positions. Two wiU work in New York and the other one wiU work in 

San Diego. How many ways can the positions be filled? 

11. (a)LetA = |1, 2, 3, 4). Determine the cardinaUty of {(ai, 02) &A^\ai+a2} 

(b) What is the answer to part (a) if | A | = n ? 

(c) Assume that A is a set with cardinality n. Determine the number of m-tuples in A m where each coordinate is different from the other 
coordinates. Break your answer down into cases m > n and m < n. 
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I 2.3 Partitions of Sets and the Laws of Addition 

One way of counting the number of students in your class would be to count the number in each row and to add these totals. Of course this 
problem is simple because there are no duplications, no person is sitting in two different rows. The basic counting technique that you used 
involves an extremely important first step, namely that of partitioning a set. The concept of a partition must be clearly understood before we 

proceed further. 

Definition: Partition. A partition oj set A is a set of one or more nonempty subsets of A: Aj, A2, ... such that: 

(a) A;IJ^2U ■■■ = Aand 

(b) the subsets are mutually disjoint: that is, Ai f] Aj = 0for i ^ j. 

The subsets in a partition are often referred to as blocks. Note how our definition allows us to partition infinite sets, and to partition a set into an 
infinite number of subsets. Of course, if A is finite the number of subsets can be no larger than | A |. 

Example 23.1. Let A = {a, b, c, d). Three partitions of A are: 

1. {{a}, {b}, {c, d]] 

2. {{a, b), {c, d}} 

3. {{a}, {b}, {c}, {d}} 

Example 2.3.2. Two examples of partitions of Z are {{n} | n e Z) and {{n | n 6 Z, n < 0), |0), [n \ n e E, n > 0)). The set of subsets 
{{n e Z I n > 0), [n e Z | n < 0)) is not a partition because the two subsets have a nonempty intersection. A second example of a non- 
partition is {{n e Z : |n| =k} \ k = -1, 0, 1, 2, ...}. One of the blocks, {neZ : |n| =-l}isempty. 

One could also think of the concept of partitioning a set as a "packaging problem." How can one "package" a carton of, say, twenty-four cans? 
We could use: four six-packs, three eight-packs, two twelve-packs, etc. In all cases: (a) the sum of all cans in all packs must be twenty-four, and 
(b) a can must be in one and only one pack. 

Basic Law Of Addition: If A is a finite set, and if[Aj, Aj, . . ., A„} is a partition of A, then 

|a| = I Ai I +1 A2 I +••• +1 A„ I = i \A^\ 

The basic law of addition can be rephrased as follows: If A is a finite set where A = Ai U ^^2 U •" U '^n and where A, p A^- = whenever 
+ y, then 

|A| = |A, UA2U---UA„| = \ Ai\ + \A2\+--- + \A„\ 

Example 2.3.3. The number of students in a class could be determined by adding the numbers of students who are freshmen, sopho- 
mores, juniors, and seniors, and those who belong to none of these categories. However, you probably couldn't add the students by major, 
since some students may have double majors. 

Example 23.4. The sophomore computer science majors were told they must take one and only one of the following courses. Cryptogra- 
phy, Data Structures, or Java script, in a given semester. The numbers in each course, respectively, for sophomore CS majors, were 75, 60, 
55. How many sophomore C.S. majors are there? The Law of Addition applies here. There are exactly 75 + 60 + 55 = 190 CS majors since 
the rosters of the three courses listed above would be a partition of the CS majors 

Example 23.5. It was determined that all sophomore computer science majors took at least one of the following courses: Cryptography, 
Data Structures, or Java script. Assume the number in each course was as in Example 2.3.4. Further investigation indicated ten of them took 
all three courses, twenty-five took Calculus and Data Structures, twelve took Calculus and Compiler Construction, and fifteen took Data 

Structures and Compiler Construction. How many sophomore C.S. majors are there? 

Example 2.3.4 is a simple application of the law of addition; in Example 2.3.5, however, some students are taking two or more courses, so a 
simple application of the law of addition would lead to double or triple counting. We rephrase Example 2.3.5 in the language of sets to 
describe the situation more explicitly if we let: 

A = the set of all sophomore computer science majors 
Ai = the set of all sophomore CS majors who took Cryptography 
A2 = the set of all sophomore CS majors who took Data Structures 
A3 = the set of all sophomore CS majors who took javascript. 

Since all sophomore CS majors must take at least one of the courses, the number we want is: 

|A| = I A, UA2 U A3 I 

= + + - duplicates 

A Venn diagram is helpful to visualize the problem. In this case the universal set U can stand for any reasonable set, for example, the set of 
all students in the university. 
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Figure 2.3.1 

We see that the whole universal set is naturally partitioned into subsets that are labeled by the numbers 1 through 8, and the set A is partitioned 
into subsets labeled 1 through 7. Note also that students in the subsets labeled 2,3, and 4 are double counted, and those in the subset labeled 1 
are triple counted, in the sense that they have already been subtracted from the total twice. So 

|A| = I Ai UA2 U A3 I 

= I Ai I + I A2 I + I A3 I -duplicates 

= I Ai I + I A2 I + I A3 I -(duplicates - triplicates) 

= I A, I + IA2 I + I A3 I -IAiHAjI -|A, nA3|-|A2nA3| + |AinA2nA3| 
= 75 + 60 + 55 - 25 - 12 - 15 + 10 = 148 

Note that an alternate approach to this example could be to let A = U; in this case the subset labeled 8, namely (A\ U A2 U A3 )'', is the set of 
sophomore C. S. students who took none of the three courses, which is 0. The concepts discussed in this latest basic counting technique give 
rise to the following two formulas: 

Laws of Addition (Inclusion-Exclusion Laws) 

1 . If Ai and A2 are finite sets, then \ A\ U A2 | = \ A\ | + | A 2 | - | Ai H A2 | 

2. IfAi,A2, and A3 are finite sets, then 

I Ai UA2 U A3 I = I A, I + I A2 I + I A3 I 

-|AinA2i -|AinA3i-|A2nA3i 
+ 1 A, n A2 n A3 1 

In this section we saw that being able to partition a set into disjoint subsets gives rise to a handy counting technique. Given a set, there are many 
ways to partition depending on what one would wish to accomplish. One natural partitioning of sets is apparent when one draws a Venn 
diagram. This particular partitioning of a set will be discussed further in Chapters 4 and 13. 

EXERCISES FOR SECTION 2.3 
A Exercises 

1 . Find all partitions of the set A = {a, b, c}. 

2. Which of the following collections of subsets of the plane, IR x R, are partitions? 

(a) [[(X, J) I X + J = c) I c e R) 

(b) The set of all circles in IR x R. 

(c) The set of all circles in R x R centered at (0, 0), together with |(0, 0)). 

(d) |j(x, y)] I (X, J) eRxR) 

3. A student, on an exam paper, defined the term partition the following way: "Let A be a set. A partition of A is any set of nonempty subsets 
Ai , A2, . . . , A„ of A such that each element of A is in one of the subsets A,." Is this definition correct? Why? 

4. Let Ai and A2 be subsets of a set A. Draw a Venn diagram of this situation and shade in the subsets: Aj H At, A] H A2, A\ H A2, and 
A| n A2. Use the resulting diagram and the definition of partition to convince yourself that subset of these four subsets that are nonempty form 
a partition of A. 

5. Show that {|2« | « e Z), [2n + 1 | « e Z|| is a partition of Z. Describe this partition using only words. 

6. (a) A group of 30 students were surveyed and it was found that 18 of them took Calculus and 12 took C++. If all students took at least one 
course, how many took both Calculus and C++? Illustrate using a Venn diagram. 

(b) What is the answer to the question in part (a) if five students did not take either of the two courses? Illustrate using a Venn diagram. 
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7. A survey of 90 people, 47 of them played tennis and 42 of them swam. If 17 of the them participated in both activities, how many of them 
participated in neither. 

8. A survey of 300 people indicated: 

60 owned an iPhone 75 owned an Blackberry, and 30 owned an Android. Furthermore, 40 owned both an iPhone and Blackberry, 12 owned 
both an iPhone and Android, and 8 owned a Blackberry and an Android. Finally, 3 owned all three phones 

(a) How many people surveyed owned none of the three phones? 

(b) How many people owned an Blackberry but not an iPhone? 

(c) How many owned a Blackberry but not an Android? 

B Exercises 

9. (a) Use Inclusion-exclusion Law 1 to derive Law 2. Note, a knowledge of basic set laws is needed for this exercise, (b) State and derive the 

Inclusion-exclusion law for four sets. 

10. To complete your spring schedule, you must add Calculus and Physics. At 9:30, there are three Calculus sections and two Physics sections; 
while at 11:30, there are two Calculus sections and three Physics sections. How many ways can you complete your schedule if your only open 
periods are 9:30 and 11:30? 

C Exercise 

11. The definition of Q = [ajb \ a, b & b t 0} given in Chapter 1 is at best awkward, since, if we use the definition to list elements in Q, 

1-12 

we win have dupUcations, that is, -, — , - etc. Try to write a more precise definition of the rational numbers so that there is no dupUcation of 
elements. 
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I 2.4 Combinations and the Binomial Theorem 

COMBINATIONS 

In Section 2.1 we investigated the most basic concept in combinatorics, namely, the rule of products. Even though in this section we investigate 
other counting formulas, it is of paramount importance to keep this fundamental process in mind. In Section 2.2 we saw that a subclass of rule- 
of-products problems appears so frequently that we gave them a special designation, namely, permutations, and we derived a formula as a 
computational aid to assist us. In this section we will investigate another major subclass of the rule-of-product formula, namely, that given the 
name combinations. 

In many rule-of-products applications the permutation or order is important, as in the situation of the order of putting on one's socks and shoes; 
in some cases it is not important, as in placing coins in a vending machine or in the listing of the elements of a set. Order is important in 
permutations. Order is not important in combinations. 

Example 2.4.1. How many different ways are there to permute three letters from the set A = [a, b, c, d}l From the permutation 

41 

formula of Section 2.2 there are P (4; 3) = — '— = 24 different orderinss of three letters from A. 

^ ' (4-3)! ^ 

Example 2A2,. How many ways can we simply list, or choose, three letters from the set A = {a, b, c, d}7 Note here that we are not 
concerned with the order of the three letters. By trial and error, certainly abc, abd, acd, and bed are the only listings possible. A slightly more 
elegant approach would be to rephrase the question in terms of sets. What we are looking for is all three-element subsets of the set A. Order is 

not important in sets. The notation for choosing 3 elements from 4 is most commonly ^ ) "'^ occasionally C (4; 3), either of which is read "4 

choose 3" or the number of combinations for four objects taken three at a time. 

' n 



Definition: Binomial Coefficient. The binomial coefficient j^jof Cin; k) represents the number of combinations ofn objects taken k 
at a time, and is read "n choose k." 

We would now like to investigate the relationship between permutation and combination problems in order to derive a formula for ^ ^ j. Let us 

reconsider the above examples. There are 3 ! = 6 different orderings for each of the three-element subsets of A. Table 2.4.1 lists each subset of 
A and all permutations of each subset on the same line. 



3 - element subsets of A 


Permutations of each subset 


abc 


abc, acb, bca, bac , cab, cba 


abd 


abd, adb, bda, bad, dab, dba 


acd 


acd, adc, cda, cad, dac, dca 


bed 


bed, bdc, cdb, cbd, dbc, deb 



Hence 



■il)- 



Table 2.4.1 



: P(4; 3) = 4. We generalize this result in the following theorem: 



Theorem 2.4.1. If A is any finite set ofn elements, the number ofk-element subsets of A is: 

I n \ ^1 

where < k < n. 



Proof: There are k ! ways of ordering each of the k objects in any set of k elements. Therefore 



( 



1 „/• 7\ 1 »' 

— P(n, k) = —■ 



k\ (n-k)\ 



Alternate Proof: To "construct" a permutation of k objects from a set of n elements, we can first choose one of the subsets of objects and 
second, choose one of the k! permutations of those objects. By the rule of products. 



P(n\ k) ■■ 
and solving for ^ ^ j we get 



k\ 



\k I k\ ^ ' ' k\{n-k)< 

Example 2.43. Assume an evenly balanced coin is tossed five times. In how many ways can three heads be obtained? This is a combination 
problem, because the order in which the heads appear does not matter. The number of ways to get three heads is 

[3 3! (5-3)! 



-P{n-k)=^^ 



5x4 
2x1 



= 10 
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Example 2.4.4. Determine the total number of ways a fair coin can land if tossed five times. The four tosses can produce any one of the 
following mutually exclusive, disjoint events: 5 heads, 4 heads, 3 heads, 2 heads, 1 head, or heads. Hence by the law of addition we have: 



^'' + f^Uf^Uf^Uf^l=l+5 + 10+10 + 5 + l=32 



Of course, we could also have applied the extended rule of products, and since there are two possible outcomes for each of the five tosses, we 
have 2x2x2x2x2x2 = 2' = 32 ways. You might think that doing this counting two ways is a waste of time but solving a problem two 

« /n \ 

different ways often is instructive and leads to valuable insights. In this case, it suggests a general formula for the sum 2 . In the case of n 

= 5, we get 2^, so it is reasonable to expect that the general sum is 2", and it is. 

Example 2AS. A committee usually starts as an imstructured set of people selected from a larger membership. Therefore, a committee 

(25 \ 

can be thought of as a combination. If a club of 25 members has a five-member social committee, there are I ^ I ~ ^"^^ different possible 

social committees. If any structure or restriction is placed on the way the social committee is to be selected, the number of possible commit- 
tees will probably change. For example, if the club has a rule that the treasurer must be on the social committee, then the number of possibih- 

(2A\ 

ties is reduced to I ^ 1 = 10 626. If we further require that a chairperson other than the treasurer be selected for the social committee, we 

/ 24 \ /" 24 \ 

have I 4 I ><4 = 42504 different possible social committees. The choice of the four non-treasurers accoimts for the I ^ I and the choice of 

a chairperson accounts for the 4. 

Example 2AjS. There ^ ^ j = 1 way of choosing a combination of zero elements from a set of n, and there ^ ^ j = 1 way of 
choosing a combination of n elements from a set of n. 

THE BINOMIAL THEOREM 

The binomial theorem gives us a formula for expanding (x + y)" where n is a nonnegative integer. The coefficients of this expansion are 
precisely the binomial coefficients that we have used to count combinations. From high school algebra we can certainly compute (x + y)" for 
n = 0, 1, 2, 3, 4, 5 as given in the following table: 



{x + yf = l 


1 


(x + yY = X + y 


1 1 


(x + y)^ =x^ +2 y x + y^ 


1 2 1 


(x + ji)^ = JC-' + 3 >• jc^ + 3 _y'2 x + y^ 


13 3 1 


(x + y't = x'' ^-Ayx" + (3y^ £■ + x + y'^ 


1 4 6 4 1 


(x + y)^ =x' ^-Syx^ + lOy^j:^ + lOy^ j:^ + 5 / + 


1 5 10 10 5 1 



TABLE 1A2. 

In the expansion of {x + y'f we note that the coefficient of the third term is |^ ^ ) =10' that of the sixth term is ^ ^ j = 1 . We can rewrite 

the expansion as|^ + )-''''^"*^(2 j'^^'^'^^ls )-^''''^'''(4 )^"*'*'^(5 )-^' 
In summary, in the expansion of (x + yY we note: 

1 . The first term is jt" and the last term is . 

2. With each successive term, exponents of x decrease by 1 as those of y increase by 1 . For any term the sum of the exponents is n. 



3. The coefficient of x" j , the(^ + 1)"' term, is 



4. The triangular array of numbers in Table 2.4.2 is called Pascal's triangle after the seventeenth-century French mathematician Blaise Pascal. 
Note that each number in the triangle other than the I's at the ends of each row is the sum of the two numbers to the right and left of it in the 
row above. 

Theorem 2.4.2: The Binomial Theorem. Ifn >0, and x and y are numbers, then 

{x+yr= zf?l^'V 

k=o^ > 

This theorem wiU be proven using a procedure called mathematical induction, which wiU be introduced in Chapter 3. 
Example 2^.8. Find the third term in the expansion of (x - y)"^. Since (x - y)"^ = (x + {-y)) , the third term is 
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= x*-\-yf= 6x2/. 



Example 2.4.9. Expand (3 x - 2f . If we replace x and y in the Binomial Theorem with 3 x and -2, respectively you get 

• 3 



3 

I 

*-0 



(3x-2y = 2 I ^ |(3x)3-*(-2/ 



^ ) (3 xf (-If + ( 1 ) (3 (-2)1 + ( 2 ) (3 ^■)' (-2)' + ( 3 ) (3 -^)° ("2)' 



:27x-^'-54x2 + 36x- 



Mathematica Note 



Mathematica has a built-in function for binomial coefficients, Binomial. Unlike the examples we've concentrated on that can be done without 
technology, you can compute extremely large values. For example, a bridge hand is a 13 element subset of a standard 52 card deck — the order 

/ 52 ^ 

in which the cards come to the player doesn't matter. From the point of view of a single player, the number of possible bridge hands is I I, 

which is easily computed with Mathematica: 
Binomial [52, 13] 

635013559600 

In bridge, the location of a hand in relation to the dealer has some bearing on the game. An even truer indication of the number of possible 
hands takes into account each player's possible hand. It is customary to refer to bridge positions as West, North, East and South. We can apply 



the rule of product to get the total number of bridge hands with the following logic. West can get any of the |^ j hands identified above. Then 

/ 39 \ / 26 \ 

North get 13 of the remaining 39 cards and so has I I possible hands. East then get 13 of the 26 remaining cards, which has I ^-^ I possibili- 
ties. South gets the remaining cards. Therefore the number of bridge hands is 

Binomial[52, 13] Binomial [39, 13] Binomial [26, 13] 

53 644 737 765 488 792 839 237 440 000 

pFi^ Sage Note 

Sage will do the same calculations for bridge hands just as easily. From a command line interface the calculations look like this: 

sage: binomial(52,13) 

635013559600 

sage: binoinial(52,13)*binoinial(39,13)*binoinial(26,13) 

53644737765488792839237440000 

The syntax is different, but the results are the same. 

EXERCISES FOR SECTION 2.4 
A Exercises 

1. The judiciary committee at a college is made up of three faculty members and four students. If ten faculty members and 25 students have 
been nominated for the committee, how many judiciary committees are possible? 

2. Suppose that a single character is stored in a computer using eight bits. 

(a) How many bit patterns have exactly three 1 's? 

(b) How many bit patterns have at least two 1 's? 

3. How many subsets of 1 1 , . . . , 10) contain at least seven elements? 

4. The congressional committees on mathematics and computer science are made up of five congressmen each, and a congressional rule is that 
the two committees must be disjoint. If there are 385 members of congress, how many ways could the committees be selected? 

5. Expand (2x - ?,yf 
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6. Find the fourth terni of the expansion of{x — 2 -yf . 

7. (a) A poker game is played with 52 cards. How many "hands" of five cards are possible? 

(b) If there are four people playing, how many five-card "hands" are possible on the first deal? 

8. A flush in a five-card poker hand is five cards of the same suit. How many spade flushes are possible in a 52-card deck? How many flushes 
are possible in any suit? 

9. How many five-card poker hands using 52 cards contain exactly two aces? 

10. In poker, a fuU house is three-of-a-kind and a pair in one hand; for example, three fives and two queens. How many fuU houses are 
possible from a 52- card deck? 

11. A class of twelve computer science students are to be divided into three groups of 3, 4, and 5 students to work on a project. How many 
ways can this be done if every student is to be in exactly one group? 



13. There are ten points Pi , Pa. • • • > Pn on a plane, no three on the same line. 

(a) How many lines are determined by the points? 

(b) How many triangles are determined by the points? 

14. How many ways can n persons be grouped into pairs when n is even? Assume the order of the pairs matters, but not the order within the 
pairs. For example, if n = 4, the different groupings would be 

{{1,2), {3,4)1 {{3,4), {1,2)) 
{{1,3), {2,4)) {{2,4), {1,3)) 
{{1,4), {2, 3)) {{2,3), {1,4)) 

15. Use the binomial theorem to prove that if A is a finite set, | f{A) | = 2l^l . Hint: see Example 2.4.7. 

16. (a) A state's lottery involves choosing six different numbers out of a possible 36. How many ways can a person choose six numbers? 
(b) What is the probabihty of a person winning with one bet? 

17. Use the binomial theorem to calculate 9998' Note that 9998' = (10000 - if. 

18. Suppose two gamblers are playing poker and are dealt five cards each. Use technology to determine the number of possible ways the hands 
could be dealt. Assume, as in bridge, the order of the hands matters. 



B Exercises 



12. Explain in words why the following equaUties are true and then verify the equalities using the formula for binomial coefficients. 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 2 

Section 2.1 

1. A university would like to detemiine how many different three-digit telephone extensions can be made using the digits 
0, 1,2,3,4,5,6, 7,8,9 

(a) if the three digits must be different 

(b) if the three digits can include duplications. 

2. A typical telephone number in most larger cities involves a seven-digit number using the digits 0, 1, 2, 9. The first three digits are 
referred to as the office code, and are frequently the same for smaller cities. 

(a) How many different telephone numbers can one create using only the last four digits? What if all four digits must be different? 

(b) How many different telephone numbers can be created using all seven digits? What if the first digit caimot be the number 0? 

3. (a) Let A = {a, b, c, d, eJandS = {1, 2, 3}. Draw a tree diagram similar to Figure 2.1.1 to illustrate A X B. 

(b) Let A and B be sets where \A \ =m and |B| = n. Use a tree diagram to prove |AxB| = |A| x |B|. 

4. In a business meeting involving four executives, each person shakes every other person's hand. How many handshakes will occur? 

5. How many ways can three couples be seated at a round table? (This problem may be harder than it seems.) 

(a) The first way to consider seating them is as each couple as a unit. 

(b) The second way is that the people in a couple may switch seats. How does this affect the seating plan? 

6. For the main course at a restaurant you have the choice of beef (prepared three ways), chicken (fried or broiled), or any one of two fish 
dishes. Baked or french fried potatoes and a choice of three vegetables complement the main course. Red or white wine or water are the only 
beverages served. However, the owner is fussy, and he will serve red wine or water only with meat dishes, and white wine or water only with 
the fish dishes. How many meals can be ordered? 

Section 2.2 

7. (a) How many ways are there to arrange six people in a circle? Two arrangements are to be considered the same if everyone has the same 
right-hand neighbor and the same left-hand neighbor regardless of the exact seat each person occupies. 

(b) How many ways are there to arrange six people in a circle if person pi cannot sit next to person p2? 

8. There are five roads from city A to city B and six roads from city B to city C. There are no direct routes from A to C. 

(a) How many different ways are there from city A to C? 

(b) How many different ways are there from A to C and back to A? 

(c) What is the answer to part b if each road is to be used exactly once? 

Section 2.3 

9. (a) A contest is entered by ten people. There will be four different prizes awarded among the ten people. No person wiU receive more than 
one prize. In how many ways can the prizes be distributed? 

(b) Suppose that instead of awarding four different prizes to the ten people it is decided that there will be four $10 prizes, three $7 prizes, 
and three $5 prizes. Suppose that each person will win a prize. In how many ways can the prizes be distributed? 

10. A real estate developer has finished constructing twelve houses and must paint them. He has purchased quantities of white and blue 
paint so that five of the houses will be blue and the remainder will be white. How many ways can he decide to paint the houses? 

11. Ten persons apply for three identical jobs and all are qualified for the positions. Two of these persons wiU work in New York and the other 
one wiU work in San Diego. How many ways can the positions be filled? 

Section 2.4 

12. Let A = {1, 2, 3, ... , 10}. 

(a) Determine the cardinality of f (A). 

(b) How many subsets of A contain exactly four elements? 

(c) How many subsets of A contain at least four elements? 

13. Among eleven senators, how many ways are there to select (a) a committee of five members; (b) a committee of five members so that a 
particular senator. Senator A, is always included; (c) a committee of five so that either Senator A or Senator B will be included? 
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14. Use the combination fomula to prove that 




15. (a) How many ways can three people be seated in a car with four seats? Assume that someone must drive. 

(b) What is your answer if only two of the three people have a driver's license? 

16. (a) Let T = 1 1 , 2, 3, 4, 5). How many subsets of T have less than four elements? 

(b) How many proper subsets of {1, 2, 3, 4, 5} contain the numbers 1 and 5? How many of them also do not contain the number 2? 
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chapter 3 




LOGIC 



GOALS 

In this chapter, we will introduce some of the basic concepts of mathematical logic. In order to fully understand some of the later concepts in 
this book, you must be able to recognize valid logical arguments. Although these arguments will usually be applied to mathematics, they 
employ the same techniques that are used by a lawyer in a courtroom or a physician examining a patient. An added reason for the importance of 
this chapter is that the circuits that make up digital computers are designed using the same algebra of propositions that we will be discussing. 
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PROPOSITIONS 

Definition: Proposition. A proposition is a sentence to which one and only one of the terms true or false can be meaningfully applied. 

Example 3.1.1. "Four is even," "4 e {1, 3, 5}," and "43 > 21" are propositions. 

In traditional logic, a declarative statement with a definite truth value is considered a proposition. Although our ultimate aim is to discuss 
mathematical logic, we won't separate ourselves completely from the traditional setting. This is natural because the basic assumptions, or 
postulates, of mathematical logic are modeled after the logic we use in everyday speech. Since compound sentences are frequently used in 
everyday speech, we expect that logical propositions contain connectives like the word and. The statement "Europa supports life or Mars 
supports life" is a proposition and, hence, must have a definite truth value. Whatever that truth value is, it should be the same as the truth value 
of "Mars supports life or Europa supports hfe." 

LOGICAL OPERATORS 

There are several ways in which we commonly combine simple statements into compound ones. The words/phrases and, or, not, if... then, and 
if and only i/can be added to one or more propositions to create a new proposition. To avoid any confusion, we will precisely define each one's 
meaning and introduce its standard symbol. With the exception of negation (not), all of the operators act on pairs of propositions. Since each 
proposition has two possible truth values, there are four ways that truth can be assigned to two propositions. In defining the effect that a logical 
operator has on two propositions, the result must be specified for all four cases. The most convenient way of doing this is with a truth table, 
which we will illustrate by defining the word and. 

Conjunction 

Definition: Conjunction (And). If p and q are propositions, their conjunction, p and q (denoted p A w defined by the truth table in 
Table 3.1.1. 



p 


1 


pAi 














1 





1 








1 


1 


1 



Table 3.1.1. Truth Table for And 

Notes: 

(a) To read this truth table, you must realize that any one line represents a case: one possible set of values for p and q. 

(b) The numbers and 1 are used to denote false and true, respectively. This is consistent with the way that many programming languages 
treat logical, or Boolean, variables since a single bit, or 1, can represent a truth value. Although Mathematica's logical expressions have a 
value of True or False, there is a built in function called Boole which converts the value to 1 or 0, if desired. 

{Boole [False] , Boole [True]} 

|0, 11 

(c) For each case, the symbol under p represents the truth value of p. The same is true for q. The symbol under p Al represents the truth value 
oi p l\q for that case. For example, the second row of the truth table represents the case in which p is false, q is true, and the resulting truth 
value for p /\q h false. As in everyday speech, p /\ q h true only when both propositions are true. 

(d) Just as the letters x, y, and z are frequently used in algebra to represent numeric variables, p, q, and r seem to be the most commonly used 
symbols for logical variables. When we say that p is a logical variable, we mean that any proposition can take the place of p. 

(e) One final comment: The order in which we Ust the cases in a truth table is standardized in this book. If the truth table involves two simple 
propositions, the numbers under the simple propositions can be inter preted as the two-digit binary integers in increasing order, 00, 01, 10, and 

11, for 0, 1,2, and 3. 

Disjunction 

Definition: Disjunction ( Or). Ifp and q are propositions, their disjunction is p or q, denoted p V q, and is defined by the truth table in 
Table 3.1.2. 

P 1 p\l q 



1 1 

1 1 
1 1 1 
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Table 3.12. Truth Table for Or 

Note; The only case in which disjunction is false is when both propositions are false. This interpretation of the word or is called the nonexclu- 
sive or. The exclusive or wiU be discussed when we consider logical design in Chapter 13. 

Negation 

Definition: Negation (Not). Ifp is a proposition, its negation, not p, is denoted -<p and is defined by the truth table in Table 3.1.3. 

P 

1 

1 

Table 3.13 Truth Table for Not 

Note: Negation is the only standard operator that acts on a single proposition; hence only two cases are needed. 
The Conditional Operator {If . . . then). 

Consider the following propositions from everyday speech: 

(a) I'm going to quit if 1 don't get a raise. 

(b) If 1 pass the final, then I'll graduate. 

(c) I'll be going to the movies provided that my car starts. 

All three propositions are conditional, they can all be restated to fit into the form if Condition, then Conclusion. For example, statement (a) can 
be rewritten as "If 1 don't get a raise, then I'm going to quit." 

A conditional statement is meant to be interpreted as a guarantee; if the condition is true, then the conclusion is expected to be true. It says no 
more and no less. 

Definition: Conditional Operator. The conditional statement ifp then q, denoted p q, is defined by the truth table in Table 3.1 .4. 



p 


q 










1 
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1 
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1 


1 



Table 3.1.4 Truth Table for //... then 

Example 3.1.2. Assume your instructor told you "If you receive a grade of 95 or better in the final examination, then you will receive an 
A in this course." Your instructor has made a promise to you (placed a condition with you). If you fulfill his condition you expect the 
conclusion (getting an A) to be forthcoming. Your graded final has been returned to you. Has your instructor told the truth (kept the promise) 
or is your instructor guilty of a falsehood? 

Case I: Your final exam score was less than 95 (the condition is false) and you did not receive an A (the conclusion is false). The instructor 
told the truth. 

Case 11: Your final exam score was less than 95, yet you received an A for the course. The instructor told the ttuth. (Perhaps your overall 
course average was excellent.) 

Case 111: Your final exam score was greater than 95, but you did not receive an A. The instructor lied. 

Case IV: Your final exam score was greater than 95, and you received an A. The instructor told the truth. 

To sum up, the only case in which a conditional proposition is false is when the condition is true and the conclusion is false. 

The order of the condition and conclusion in a conditional proposition is important. If the condition and conclusion are exchanged, a different 
proposition is produced. 

Definition: Converse. The converse of the proposition p-> q is the proposition q ->p. 

The converse of "If you receive a grade of 95 or better in the final exam, then you will receive an A in this course," is "If you receive an A in 
this course, then you received a grade of 95 or better in the final exam." It should be clear that these two statements say different things. 

Definition: Biconditional Operator (...if and only if...). If p and q are propositions, the biconditional statement "p if and only if q," 
denoted p<-> q, is defined by the truth table in Table 3.1.5. 
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p<^ 
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1 
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Table Truth table for "...if and only if..." 

Note that p ^ qis true when p and q have the same truth values. It is common to abbreviate "if and only if to "iff." 

Although . . then" and "if and only if are frequently used in everyday speech, there are several alternate forms that you should be aware of. 

They are summarized in the following list: 

Conditional 

If p then ^. 

p implies q. 

q follows from q. 

p, only if q. 

q,iip. 

p is sufficient for q. 

q is necessary for p. 

Biconditional 

p if and only if q. 

p is necessary and sufficient for q. 

p is equivalent to q. 

If p, then q, and if q, then p. 

If p, then q and conversely. 

EXERCISES FOR SECTION 3.1 
A Exercises 

1. Let d = "I like discrete structures" c = "I wiU pass this course" s = "I wiU do my assignments" Express each of the following propositions in 
symbolic form: 

(a) I like discrete structures and I wiU pass this course. 

(b) I win do my assignments or I wiU not pass this course. 

(c) It is not true that I like discrete structures and I wiU do my assignments. 

(d) I wiU not do my assignment and I wiU not pass this course. 

2. For each of the following propositions, identify simple propositions, express the compound proposition in symbolic form, and determine 
whether it is true or false: 

(a) The world is flat or zero is an even integer. 

(b) If 432,802 is a multiple of 4, then 432,802 is even. 

(c) 5 is a prime number and 6 is not divisible by 4. 

(d) 3 e Z and 3 e Q. 

(e) 2/3 e Zand 2/3 e Q. 

(f) The sum of two even integers is even and the sum of two odd integers is odd. 

3. Let p = "2 < 5," q = "S is an even integer," and r = "11 is a prime number." Express the following as a statement in English and determine 
whether the statement is true or false: 

(a) - p V ? 

Qo) p^q 

(c) (p Aq)-^r 

(d) p ^ <? V (- r) 

(e) p ^ (- V (- 
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(f)-iq^-ip 

4. Rewrite each of the following statements using the other conditional forms: 

(a) If an integer is a multiple of 4, then it is even. 

(b) The fact that a polygon is a square is a sufficient condition that it is a rectangle. 

(c) If X = 5, then jc^ = 25. 

(d) Ifjc2 - 5jc + 6 = 0,thenjc = 2orjc = 3. 

(e) = is a necessary condition for x = y. 

5. Write the converse of the propositions in exercise 4. Compare the truth of each proposition and its converse. 
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I 3.2 Truth Tables and Propositions Generated by a Set 

Consider the compound proposition c = (p /\ q)\/ q /\ r), where p, q, and r are propositions. This is an example of a proposition gener- 
ated by p, q, and r. We will define this terminology later in the section. Since each of the three simple propositions has two possible truth 
values, it follows that there are eight different combinations of truth values that determine a value for c. These values can be obtained from a 
truth table for c. To construct the truth table, we build c from p, q, and r and from the logical operators. The result is Table 3.2.1. Strictly 
speaking, the first three columns and the last column make up the truth table for c. The other columns are work space needed to build up to c. 
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Table 3.2.1 Truth Table for c = (/> A ^r) V (-■ « A r) 

Note that the first three columns of the truth table are an enumeration of the eight three-digit binary integers. This standardizes the order in 
which the cases are listed. In general, if c is generated by n simple propositions, then the truth table for c will have 2" rows with the first n 
columns being an enumeration of the n digit binary integers. In our example, we can see at a glance that for exactly four of the eight cases, c 
will be true. For example, if p and r are true and q is false (the sixth case), then c is true. 

Let S be any set of propositions. We will give two definitions of a proposition generated by S. The first is a bit imprecise, but should be clear. 
The second definition is called a recursive definition. If you find it confusing, use the first definition and return to the second later. 

Definition: Proposition Generated by S. 

(1) A proposition generated by S is any valid combination of propositions in S with conjunction, disjunction, and negation . 

(2) (a) If p e S, then p is a proposition generated by S. 

(b) Ifx and y are propositions generated by S, then so are (x), -ix,x\J y , and x /\y. 

Note: We have not included the conditional and biconditional in the definition because they can both be obtained fi-om conjunction, disjunction, 

and negation, as we will see later. 

If 5 is a finite set, then we may use slightly different terminology. For example, if 5 = Ip, q, r], we might say that a proposition is generated 
by ^, and r instead of {p, q, r). 

Hierarchy of Logical Operations 

It is customary to use the following hierarchy for interpreting propositions, with parentheses overriding this order: 
First: Negation 
Second: Conjunction 
Third: Disjunction 

Within any level of the hierarchy, work from left to right. Using these rules, p /\ q \/ r is taken to mean A V '"• These precedence rules 
are universal, and are exactly those used by computer languages to interpret logical expressions. 

Example 3.2.1. A few shortened expressions and their fuUy parenthesized versions: 

(a) P A q A ris(p A q) A r. 

(b) V -ris(-p) V (-r). 

(c) -. -. p is (-. p). 

A proposition generated by a set S need not include each element of S in its expression. For example, -r ^ A '" is a proposition generated by p, 
q, and r. 

EXERCISES FOR SECTION 3.2 
A Exercises 

1 . Construct the truth tables of: 
(a) pVp 
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(b) pA(-p) 

(c) pV(-p) 

(d) p Ap 

2. Construct the truth tables of: 

(a) -(pA 

(b) P A (- 

(c) (p Aq)Ar 

(d) (p A?)Vte A'-)V('- A P) 

(e) ^ pM 

(f) pV^VrVi 

3. Rewrite the following with as few extraneous parentheses as possible: 

(a) (-((p) A W)) V is) 

(b) ((p) V iq)) A (('•) V (g)) 

4. In what order are the operations in the following propositions performed? 

(a) p\J ^q\J rA^P 

(b) pA^qArA^P 

5. Determine the number of rows in the truth table of a proposition containing four variables p, q, r, SLtids. 

6. If there are 45 lines on a sheet of paper, and you want to reserve one line for each line in a truth table, how large could | 5 | be if you can 
write truth tables of propositions generated by S on a sheet of paper? 
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Tautologies & Contradictions 

Consider two propositions generated by p and q: ^(p /\ q) and -• p \/ ^ q. Al first glance, they are different propositions. In form, they are 
different, but they have the same meaning. One way to see this is to substitute actual propositions for p and q; such as: 

p: I've been to Toronto; and q: I've been to Chicago. 

Then -^(p t\ q) translates to "I haven't been to both Toronto and Chicago," while p \J ^ q.i% "I haven't been to Toronto or I haven't been to 
Chicago." Determine the truth values of these propositions. Naturally, they will be true for some people and false for others. What is important 
is that no matter what truth values they have, -•ip A l) and -> p V ~'<1- will have the same truth value. The easiest way to see this is by 
examining the truth tables of these propositions (). 
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Table 33.1 Truth tables of -. (p A ?) and -. /> V -■ ?■ 

In all four cases, ->{p f\ q) and -< p V have the same truth value. Then when the biconditional operator is applied to them, the result is a 

value of true in all cases. 

Definition: Tautology . An expression involving logical variables that is true in all cases is called a tautology. 
Example 3.3.1. All of the following are tautologies because their truth tables consist of a column of I's. 

(a) (^(p A?))^(- p\J ^q). 

(b) py^p 

(c) (p Aq)^ p 

(d) q^(p\lq) 

(e) ip \l q)^{q \lp) 

Definition: Contradiction. An expression involving logical variables that is false for all cases is called a contradiction. 
Example 33.2. p A ~' P and (p V A p) A q) are contradictions. 

Equivalence 

Definition: Equivalence. Let S he a set of propositions and let r and s be propositions generated by S. r and s are equivalent if and only 
if r <r^ s is a tautology. The equivalence ofr and s is denoted r s. 

Example 333. The following are all equivalences: 

(a) (p A?) V (-P Aq)<^q- 

(b) p-^q^-^q^-ip 

(c) p V q^q \l p. 

All tautologies are equivalent to one another. We wiU use the number I to symbolize a tautology. 

Example 3.3.4. p\/ ^ p^l. 
All contradictions are equivalent to one another. We wiU use the number to symbolize a contradiction. 

Example 3.3.5. p /\ -. p <^ 0. 

Equivalence is to logic what equality is to algebra. lust as there are many ways of writing an algebraic expression, the same logical meaning 
can be expressed in many different ways. 

IMPLICATIO^ 

Example 33.6. Consider the two propositions: 
x: The money is behind Door A; and 
y: The money is behind Door A or Door B . 
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Imagine that you were told that there is a large sum of money behind one of two doors marked A and B, and that one of the two propositions 
X and y is true and the other is false. Which door would you choose? All that you need to realize is that if x is true, then y will also be true. 
Since we know that this can't be the case, y must be the true proposition and the money is behind Door B . 

This is an example of a situation in which the truth of one proposition leads to the truth of another. Certainly, y can be true when x is false; but x 
can't be true when y is false. In this case, we say that x impUes y. 

Look at the truth table of p -> in Table 3.1 .4 (copied below). If p impUes q, then the third case can be ruled out, since it is the case that makes 
a conditional proposition false. 
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Definition: ImpUcation. Let She a set of propositions and let r and s be propositions generated by S. We say that r implies sifr-^s is a 
tautology. We write r => sto indicate this implication. 

Example 33.7. A commonly used impUcation is that p impUes p \/ q, which is verified by the truth table in Table 3.3.2. 

P 1 p\J q P^p\lq 
1 
Oil 1 
10 1 1 
111 1 

Table 33.2 Truth Table for p ^ {p y q) 

If we let p represent "The money is behind Door A" and q represent "The money is behind Door B," p => (p \/ q) is a formalized version of 
the reasoning used in Example 3.3.6. A common name for this implication is disjunctive addition. In the next section we wiU consider some 
of the most commonly used impUcations and equivalences. 

When we defined what we mean by a proposition generated by a set in Section 3.2, we didn't include the conditional and biconditional 
operators. This was because of the two equivalences p^q<^^p\lq and p q <^ {p f\ q) \J (-^ p /\ ^ q). Therefore, any proposi- 
tion that includes the conditional or biconditional operators can be written in an equivalent way using only conjunction, disjimction, and 
negation. We could even dispense with disjunction since p y qis equivalent to a proposition that uses oi3y conjunction and negation. 

EXERCISES FOR SECTION 3.3 
A Exercises 

1 . Given the following propositions generated by p,q, and r, which are equivalent to one another? 

(a) ip hr)\l q 

(b) pViryq) 

(c) r A P 

(d) - r V P 

(e) ipyq)A(r\J q) 

(f) r^ p 

(g) r \J ^p 

(h) p^r 

2. (a) Construct the truth table for ;<; = (j> t\ ^ q) \l (r t\ p). 

(b) Give an example other than x itself of a proposition generated by p, q, and r that is equivalent to x. 

(c) Give an example of a proposition other than x that impUes x. 

(d) Give an example of a proposition other than x that is impUed by x. 

3. Is an implication equivalent to its converse? Verify your answer using a truth table. 

4. Suppose that xisa proposition generated by p, q, and r that is equivalent to p y ->q. Write out the truth table for x. 

B Exercises 

5. How large is the largest set of propositions generated by p and q with the property that no two elements are equivalent? 
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6. Find a proposition that is equivalent to p \/ q and uses only conjunction and negation. 

7. Explain why a contradiction implies any proposition and any proposition implies a tautology. 
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I 3.4 The Laws of Logic 

In this section, we will list the most basic equivalences and implications of logic. Most of the equivalences listed in Table 3.4.1 should be 
obvious to the reader. Remember, stands for contradiction, 1 for tautology. Many logical laws are similar to algebraic laws. For example, 
there is a logical law corresponding to the associative law of addition, a + {b + c) = (a + b) + c. In fact, associativity of both conjunction 
and disjunction are among the laws of logic. Notice that with one exception, the laws are paired in such a way that exchanging the symbols A, 
V, 1 and for V, A, 0, and 1, respectively, in any law gives you a second law. For example, p \J (=> p results in p A 1 <=> p. This called a 
duality principle. For now, think of it as a way of remembering two laws for the price of one. We will leave it to the reader to verify a few of 
these laws with truth tables. However, the reader should be careful in applying duality to the conditional operator and implication since the dual 
involves taking the converse. For example, the dual oipf\q^p\ip\Jq<^p, which is usually written p => p \J q 

Example 3.4.1. The identity law: 

P 1 pAl (PAI)^P 
10 1 
111 1 

therefore, {p /\ I) p. 

Some of the logical laws in Table 3.4.2 might be less obvious to you. For any that you are not comfortable with, substitute actual propositions 
for the logical variables. For example, if p is "John owns a pet store" and q is "John likes pets," the detachment law should make sense. 

TABLE 3.4.1 Basic Logical Laws 

Commutative Laws 

pX/q'^qX/p p Aq'i^q Ap 
Associative Laws 

{p\J q)\Jr ^ p\J (q\J r) (p Aq) Ar p A (q A r) 

Distributive Laws 

p Aiqy r)<^{p Aq )W ip Ar) p V iq A r) <^ {p W q) A (p W r) 

Identity Laws 
p y 0^ p p A \ ^ p 

Negation Laws 
p A^p^O py 1 

Idempotent Laws 
p \J p ^ p p Ap <^ p 

Null Laws 
pAO«.0 pVl<«l 
Absorption Laws 
p A\p\J q) <^ p p\J (p A q) ^ p 
DeMorgan's Laws 
- (p V « (- P) A (- q) - (P A <?) « (- P) V (- q) 

Involution Law 

-, (-, p) p 
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Detachment 

ip ^ q) A q 

Indirect Reasoning 

ip^q)A^q^^p 

Disjunctive Addition 
p^ipy q) 

Conjunctive Simplification 
iP A q) ^ P and (p f\ q) ^ q 
Disjunctive Simplification 
(p V <?) A P ^ <? and (p V A g ^ P 
Chain Rule 
[p ^ q) A (q ^ r) ^ (p ^ r) 

Conditional Equivalence 

Biconditional Equivalences 
(p ^ q) ^ (p^q) A iq ^ p)^ ip A <?) V P A ""?) 
Contrapositive 

ip^q) <=> i^q ^^p) 

EXERCISES FOR SECTION 3.4 
A Exercises 

1 . Write the following in symbolic notation and determine whether it is a tautology: "If I study then I will learn. I will not learn. Therefore, I do 
not study." 

2. Show that the common fallacy (p -> q) /\ ^ p ^ q is not a law of logic. 

3. Describe in general how duality can be applied to implications if we introduce the symbol read "is implied by." 

4. Write the dual of the following statements: 

(a) (p f\q)^ p 

(b) {p\/q)A^q^P 
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I 3.5 Mathematical Systems 

In this section, we present an overview of what a mathematical system is and how logic plays an important role in one. The axiomatic method 
that we will use here will not be duplicated with as much formality anywhere else in the book, but we hope an emphasis on how mathematical 
facts are developed and organized will help to unify the concepts we will present. The system of propositions and logical operators we have 
developed will serve as a model for our discussion. Roughly, a mathematical system can be defined as follows: 

Definition: Mathematical System. A mathematical system consists of: 

(1) A set or universe, U. 

(2) Definitions — sentences that explain the meaning of concepts that relate to the universe. Any term used in describing the universe itself is 
said to be undefined. All definitions are given in terms of these undefined concepts of objects. 

(3) Axioms — assertions about the properties of the universe and rules for creating and justifying more assertions. These rules always 
include the system of logic that we have developed to this point. 

(4) Theorems — the additional assertions mentioned above . 

Example 3.5.1. In Euclidean geometry the universe consists of points and lines (two undefined terms). Among the definitions is a 
definition of parallel lines and among the axioms is the axiom that two distinct parallel lines never meet. 

Example 3.5.2. Propositional calculus is a formal name for the logical system that we've been discussing. The universe consists of 
propositions. The axioms are the truth tables for the logical operators and the key definitions are those of equivalence and implication. We 
use propositions to describe any other mathematical system; therefore, this is the minimum amount of structure that a mathematical system 
can have. 

Definition: Theorem. A true proposition derived from axioms of mathematical system is called a theorem. 

Theorems are normally expressed in terms of a finite number of propositions, pi , p2, . . . , p„ , called the premises, and a proposition, C, called 
the conclusion. These theorems take the form 

PiA PiA ■■■ Ap„^c 

or more informally, 

Pi, p2, . . . , and p„ imply C 

For a theorem of this type, we say that the premises imply the conclusion. When a theorem is stated, it is assumed that the axioms of the system 
are true. In addition, any previously proven theorem can be considered an extension of the axioms and can be used in demonstrating that the 
new theorem is true. When the proof is complete, the new theorem can be used to prove subsequent theorems. A mathematical system can be 
visualized as an inverted pyramid with the axioms at the base and the theorems expanding out in various directions (Figure 3.5.1). 




FIGURE 3.5.1 The body of knowledge In a mathematical system 

PROOF 

Definition: Proof. A proof of a theorem is a finite sequence of logically valid steps that demonstrate that the premises of a theorem imply 
the conclusion. 

Exactly what constitutes a proof is not always clear. For example, a research mathematician might require only a few steps to prove a theorem 
to a colleague, but might take an hour to give an effective proof to a class of students. Therefore, what constitutes a proof often depends on the 
audience. But the audience is not the only factor. One of the most famous theorems in graph theory. The Four Color Theorem, was finally 
proven in 1976, after over a century of effort by many mathematicians. Part of the proof consisted of having a computer check many different 
graphs for a certain property. Without the aid of the computer, this checking would have taken years. In the eyes of some mathematicians, this 
proof was considered questionable. Shorter proofs have been developed since 1976 and there is no controversy associated with The Four Color 
Theorem at this time. (The theorem is stated in Chapter 9.) 

PROOFS IN PROPOSITIONAL CALCULUS 

Theoretically, you can prove anything in propositional calculus with truth tables. In fact, the laws of logic stated in Section 5.4 are all theorems. 
Propositional calculus is one of the few mathematical systems for which any valid sentence can be determined true or false by mechanical 
means. A program to write truth tables is not too difficult to write; however, what can be done theoretically is not always practical. For example, 

a, a^ b, 6-> c, . . . , y ^ z => z 
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is a theorem in prepositional calculus. However, suppose that you wrote such a program and you had it write the truth table for 

(a A (a ^ b) Aib ^ c) A ■■■ A(y ^ z)) ^ z 

The truth table wiU have 2^* cases. At one mUlion cases per second, it would take approximately one minute to verify the theorem. Now if you 
decided to check a similar theorem. 

Pi, Pi^ P2, ■■■ , P99 Pm =>Pioo. 

you would really have time trouble. There would be 2^"*' » 1 .26765 x 10^" cases to check in the truth table. At one miUion cases per second it 
would take approximately 1 .46719x 10^' days to check all cases. For most of the remainder of this section, we will discuss an alternate method 
for proving theorems in prepositional calculus. It is the same method that we wiU use in a less formal way for proofs in other systems. Formal 
axiomatic methods would be too unwieldy to actually use in later sections. However, none of the theorems in later chapters would be stated if 
they couldn't be proven by the axiomatic method. 

We will introduce two types of proof here, direct and indirect. 
DIRECT PROOFS 

A direct proof is a proof in which the truth of the premises of a theorem are shown to directly imply the truth of the theorem's conclusion. 
Example 3 Theorem: p^r, q^s,p\/q=^s\/r. A direct proof of this theorem is: 



Step 


Proposition 


Justification 


(1) 


pVci 


Premise 


(2) 


^p ^ q 


(1), conditional rule 


(3) 


q s 


Premise 


(4) 


-1 p s 


(2), (3), chain rule 


(5) 


-^s ^ p 


(4), contrapositive 


(6) 


p ^ r 


Premise 


(7) 


i -> r 


(5), (6), chain rule 


(8) 


syr 


(7), conditional rule ■ 



Note that ■ marks the end of a proof. 

Rules for Formal Proofs. Example 3.5.3 illustrates the usual method of formal proof in a formal mathematical system. The rules governing 

these proofs are: 

(1) A proof must end in a finite number of steps. 

(2) Each step must be either a premise or a proposition that is implied from previous steps using any valid equivalence or implication. 

(3) For a direct proof , the last step must be the conclusion of the theorem. For an indirect proof (see below), the last step must be a 
contradiction. 

(4) Justification Column. The column labeled "justification" is analogous to the comments that appear in most good computer programs. They 
simply make the proof more readable. 

Example 3SA. Here are two direct proofs of -> p y q, s\/ p, ->q s: 



(1) -< p y q Premise 

(2) -1 q Premise 

(3) -I p Disjunctive simplification, (1), (2) 

(4) sy p Premise 

(5) s Disjunctive simplification, (3), (4). ■ 
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You are invited to justify the steps in this second proof: 

(1) ^pW q 

(2) -■q ^ -'P 

(3) s\/p 

(4) py s 

(5) -^s 

(6) -I q ^ s 

(7) ^q 

(8) i ■ 

CONDITIONAL CONCLUSIONS 

The conclusion of a theorem is often a conditional proposition. The condition of the conclusion can be included as a premise in the proof of the 
theorem. The object of the proof is then to prove the consequence of the conclusion. This rule is justified by the logical law 
p (h ^ c) (p /\ h) c. 

Example 3.5S. The following proof of p -» (q -» s), --r V p, q => r -» s includes r as a fourth premise. The truth of s concludes the proof. 



(1) 


-.ryp 


Premise 


(2) 


r 


Added premise 


(3) 


P 


(1), (2), disjunction simplification 


(4) 


p ^ (q ^s) 


Premise 


(5) 


q s 


(3), (4), detachment 


(6) 


1 


Premise 


(7) 


s 


(5), (6), detachment. ■ 



INDIRECT PROOFS / PROOF BY CONTRADICTION 

Consider a theorem P C, where P represents pi, p2, . . . , and p„ , the premises. The method of indirect proof is based on the equivalence 

p^c^^iPA ^ c). 

In words, this logical law states that if P => C, then P A C is always false; that is, P A C is a contradiction. This means that a valid 
method of proof is to negate the conclusion of a theorem and add this negation to the premises. If a contradiction can be implied from this set of 
propositions, the proof is complete. For the proofs in this section, a contradiction will often take the form f A f. For proofs involving 
numbers, a contradiction might be 1 = or < 0. Indirect proofs involving sets might conclude with a: e or (x e A and x e A'). Indirect 
proofs are often more convenient than direct proofs in certain situations. Indirect proofs are often called proofs by contradiction. 

Example 3.5.6. Here is an example of an indirect proof of the theorem in Example 3.5.3: 



(1) 


^ (i V r) 


Negated conclusion 


(2) 


-1 J A -1 r 


DeM organ's Law, (1) 


(3) 


-1 s 


Conjunctive simplification, (2) 


(4) 


q-^ s 


Premise 


(5) 




Indirect reasoning, (3), (4) 


(6) 


-1 r 


Conjunctive simplification, (2) 


(7) 


p ^ r 


Premise 


(8) 


-^P 


Indirect reasoning, (6), (7) 


(9) 


(- P) A (- q) 


Conjunctive, (5), (8) 


(10) 


--{pM q) 


DeMorgan's Law, (9) 


(11) 


P V q 


Premise 


(12) 





(10),(11)B 
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PROOF STYLE 

The rules allow you to list the premises of a theorem immediately; however, a proof is much easier to follow if the premises are only listed 
when they are needed. 

Example 3.5.7. Here is an indirect proof of a -^b,-i{b\lc)=^->a. 

(1) a Negation of the conclusion 

(2) a -» Premise 

(3) b (1), (2), detachment 

(4) b y c (3), disjunctive addition 

(5) ->(b y c) Premise 

(6) (4),(5)B 

As we mentioned at the outset of this section, we are only presenting an overview of what a mathematical system is. For greater detail on 
axiomatic theories, see StoU (1961). An excellent description of how propositional calculus plays a part in artificial intelligence is contained in 
Hofstadter (1980). If you enjoy the challenge of constructing proofs in propositional calculus, you should enjoy the game WFF'N PROOF 
(1962), by L.E. Allen. 



EXERCISES FOR SECTION 3.5 
A Exercises 

1 . Prove with truth tables: 
{a)py q, ^q^ p 
Q3)p^q, -'q=>-'P 

2. Prove with truth tables: 

(a) ^, ^q^ p 
{h)p^q^^pyq 
B Exercises 

3. Give direct and indirect proofs of: 

{&) a ^ b, c ^ b, d ^ (a M c), d ^ b. 

(b) (p^q) A(r^ s), (q ^t) A (s u), -• (t A u), p ^ r ^ ^ p. 

(c) p ^ (q ^ s) , sy p , q ^ s -> r . 

(d) p q, q ^ r, ^ip A r), p y r ^ r. 

(e) ->q, p ->q, py t => f 

4. Give direct and indirect proofs of: 

(a) p q, -'r -iq, -i r => p. 

(b) p -I q, ^ r q, p r. 

(c) a y b, c A d, a ^ -> c b. 

5. Are the following arguments valid? If they are valid, construct formal proofs; if they aren't valid, explain why not. 

(a) If wages increase, then there wiU be inflation. The cost of living wiU not increase if there is no inflation. Wages wiU increase. Therefore, the 
cost of living will increase. 

(b) If the races are fixed or the casinos are crooked, then the tourist trade will decline. If the tourist trade decreases, then the police wiU be 
happy. The police force is never happy. Therefore, the races are not fixed. 

6. Determine the validity of the following argument: For students to do well in a discrete mathematics course, it is necessary that they study 
hard. Students who do well in courses do not skip classes. Students who study hard do well in courses. Therefore students who do well in a 
discrete mathematics course do not skip class. 

7. Describe how pi, pi -> P2, ■■■ , P99 -» Pioo =^ Pioo could be proven in 199 steps. 
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I 3.6 Propositions over a Universe 

Example 3.6.1. Consider the sentence "He was a member of the Boston Red Sox." There is no way that we can assign a truth value to 
this sentence unless "he" is specified. For that reason, we would not consider it a proposition. However, "he" can be considered a variable that 
holds a place for any name. We might want to restrict the value of "he" to all names in the major-league baseball record books. If that is the 
case, we say that the sentence is a proposition over the set of major-league baseball players, past and present. 

Definition: Proposition over a Universe. Let (7 be a nonempty set. A proposition over U is a sentence that contains a variable that can take on 
any value in U and that has a definite truth value as a result of any such substitution. 

Example 3.6.2. 

(a) A few propositions over the integers are4j;^ - 3x = 0, < n <5, and "k is a multiple of 3." 

(b) A few propositions over the rational numbers are 4 - 3x = 0, y^ = 2,md (j - l)(j + 1) = - 1. 

(c) A few propositions over the subsets of IP are (A =0) V (A = IP),3 e A, and A f] {1, 2, 3} 0. 

All of the laws of logic that we listed in Section 3.4 are valid for propositions over a universe. For example, if p and q are propositions over the 
integers, we can be certain that p /\ q ^ p, because (p f\ q) ^ p'la & tautology and is true no matter what values the variables in p and q are 
given. If we specify p and qiohe, p(n) : n < 4 and q(n) : n < 8, we can also say that p implies p f\ q. This is not a usual impUcation, but for 
the propositions under discussion, it is true. One way of describing this situation in general is with truth sets. 

TRUTH SETS 

Definition: Truth Set. If p is a proposition over U, the truth set of p is Tp = [a e U \ p (a) is true]. 

Example 3.63. The truth set of the proposition {1, 2) 0^ = taken as a proposition over the power set of {1, 2, 3, 4) is 
{0, {3}, {4}, {3, 4}}. 

Example 3.6.4. In the universe Z (the integers), the truth set of 4 - 3 x = is |0}. If the universe is expanded to the rational numbers, 
the truth set becomes {0, 3/4). The term solution set is often used for the truth set of an equation such as the one in this example. 

Definition: Tautology and Contradiction. A proposition over U is a tautology if its truth set is U. It is a contradiction if its truth set is 

empty. 

Example 3.6.5. - 1) -l- 1) = - 1 is a tautology over the rational numbers, - 2 = is a contradiction over the rationals. 
The truth sets of compound propositions can be expressed in terms of the truth sets of simple propositions. For example, if a e T'pA?' ^^^^ ^ 
makes p l\ q true. Therefore, a makes both p and q true, which means that a e Tp fl T^. This explains why the truth set of the conjunction of 
two propositions equals the intersection of the truth sets of the two propositions. The following Ust summarizes the connection between 
compoimd and simple truth sets: 

-^pA« = n 

Tp\Jq = Tp\jTq 

T^p = Tp-= 

7'p«, = (7',nr,)U(r/nr/) 

T =T '^WT 
^ p-*q ^ p ^ ^ q 

Definition: Equivalence. Two propositions are equivalent if p <-> q is a tautology. In terms of truth sets, this means that p and q are 
equivalent ifTp = Tq . 

Example 3.6.6. 

(a) n + 4 = 9 and n = 5 are equivalent propositions over the integers. 

(b) A n |4) t 0and4 e A are equivalent propositions over the power set of the natural numbers. 
Definition: Implication. Ifp and q are propositions over U, p implies q ifp -> q is a tautology. 

Since the truth set of p -> g is T^'' IJ T^, the Venn diagram for Tp^^ in Figure 3.6.1 shows that p ^ q when Tp cTq. 
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HGURE 3.6.1 Venn diagram for Tp^, 



Example 3.6.7. 

(a) Over the natural numbers: n < 4 ^ n < 8 since |0, 1, 2, 3, 4| c (0, 1, 2, 3, 4, 5, 6, 7, 8|. 

(b) Over the power set of the integers: | A' | =1 implies A H |0. 1| + 0- 

(c) A c even integers => A H odd integers =0. 

EXERCISES FOR SECTION 3.6 
A Exercises 

1 . If (7 = Piil, 2, 3, 4)), what are the truth sets of the following propositions? 

(a) A n |2, 4) = 0. 

(b) 3 e A and 1 ^ A. 

(c) A U lU = A. 

(d) A is a proper subset of |2, 3, 4). 

(e) |A| = |A'|. 

2. Over the universe of positive integers, define 

p(n) : n is prime and « < 32. 
q(n) : n\i & power of 3. 
r (n) : n is a divisor of 27. 

(a) What are the truth sets of these propositions? 

(b) Which of the three propositions implies one of the others? 

3. If f/ = (0, 1,2), how many propositions over U could you list without listing two that are equivalent? 

4. Given the propositions over the natural numbers: 

p : n < A 

q : In > \1 

r : « is a divisor of 1 8 
what are the truth sets of: 
(a) q 
(h) pAq 

(c) r 

(d) q^r 

5. Suppose that i is a proposition over {1, 8} . If = |1, 3, 5, 7), give two examples of propositions that are equivalent to ^. 

6. (a) Determine the truth sets of the following propositions over the 
positive integers: 

p{n) : « is a perfect square and « < 100. 
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q{n) : n = \ P{A) \ for some set A 
(b) Detemiine T^/^^ for p and q above. 

7. Let the universe be Z, the set of integers. Which of the following propositions are equivalent over Z? 

a: 0<n^ <9. 
b: 0<n^ <27. 
c: < n < 3. 
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I 3.7 Mathematical Induction 



In this section, we will examine mathematical induction, a technique for proving propositions over the positive integers. Mathematical (or 
finite) induction reduces the proof that all of the positive integers belong to a truth set to a finite number of steps. 

Mathematical Induction is sometimes called finite induction. 

Example 3.7.1. Consider the following proposition over the positive integers, which we will label p (n): The sum of the positive integers 



from 1 to n is 



n{n+\) 



This is a well-known formula that is quite simple to verify for a given value of n. For example, p (5) is: The sum of the 



positive integers from 1 to 5 is 



5(5+1) 



Indeed, 1+2 + 3+ 4 + 5=15 



5(5+1) 



Unfortunately, this doesn't serve as a proof that p (n) is a 



2 2 

tautology. All that we've established is that 5 is in the truth set of p. Since the positive integers are infinite, we certainly can't use this 
approach to prove the formula. 

An Analogy: Mathematical induction is often useful in overcoming a problem such as this one. A proof by mathematical induction is similar to 
knocking over a row of closely spaced dominos that are standing on end. To 

knock over the five dominos in Figure 3.7.1, all you need to do is push Domino 1 to the right. To be assured that they all will be knocked over, 
some work must be done ahead of time. The dominos must be positioned so that if any domino is pushed to the right, it will push the next 
domino in the line. 



FIGURE 3.7.1 Illustration of example 3.7.1 

Now imagine the propositions p(l), p (2), p(3), ... to be an infinite line of dominos. Let's see if these propositions are in the same formation 
as the dominos were. First, we will focus on one specific point of the line: p{99) and p(lOO). We are not going to prove that either of these 
propositions is true, just that the truth of p(99) implies the truth of p (100). In terms of our analogy, if p(99) is knocked over, it will knock over 
p(lOO). 

In proving p(99) => p(lOO), we will use p(99) as our premise. We must prove: The sum of the positive integers from 1 to 100 is I2!H^?2±il ^/e 

start by observing that the sum of the positive integers from 1 to 100 is (1 + 2 + ••• + 99) + 100. That is, the sum of the positive integers 
from 1 to 100 equals the sum of the first ninety-nine plus the final number, 100. We can now apply our premise, p{99), to the sum 
1 + 2 + ••• + 99. After rearranging our numbers, we obtain the desired expression for 1 + 2 + ••• + 100: 

1 + 2 + ••• + 99 + 100 =(1 + 2 + ••• + 99) + 100 



99 (99+1) 
2 

99 • 100 



+ 100 

2 y 100 



_ 100(100+1) 
~ 2 

What we've just done is analogous to checking two dominos in a line and finding that they are properly positioned. Since we are dealing with an 
infinite line, we must check all pairs at once. This is accomplished by proving that p{n) => p{n + 1) for all n > 1: 

1 + 2 + ••• + « + (n + 1) = (1 + 2 + ••• + n) + (n + 1) 

nin+l) 



1 

nin+l) 



+ (n + I) 

^ 2(n+l) 



by p{n) 



(n+l)(n+2) 



(i!+l)((n+l)+l) 



They are all lined up! Now look at p(l): The sum of the positive integers from 1 to 1 is — . Clearly, p(l) is true. This sets off a chain reaction. 
Since p(l) => p (2), p (2) is true. Since p (2) => p (3), p(3) is true; and so on. ■ 

The Principle of Mathematical Induction. Let p(n) be a proposition 
over the positive integers, then p(n) is a tautology if 

(1) p(l) is true, and 

(2) for all n > I, p(n) p(n + 1). 

Note: The truth of p (1) is called the basis for the induction proof. The premise that p(n) is true in Statement (b) is called the induction hypothe- 
sis. The proof that p(n) implies p(n + 1) is called the induction step of the proof. Despite our analogy, the basis is usually done first in an 
induction proof. The order doesn't really matter. 
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Example 3.7.2. Consider the implication over the positive integers 

pin): qo qi, qi q2 qn-i qn, ?o => ?n 

A proof that p (n) is a tautology follows. 

Basis: pil) is qo qi, qo=^ qi - This is the logical law of detachment which we know is true. If you haven't done so yet, write out the truth 
table of ((^0 -» ?i ) A qo) ^ qi to verify this step. 

Induction: Assume that p (n) is true for some n > 1 . We want to prove that p(_n + 1) must be true. That is: 

qo q\, q\^q2, ■■■ , qn-l ^ qn , qn^qn+l, ?0 => ?n+l 
Here is a direct proof of p (n + 1): 

Steps Proposition(s) Justification 

(1) 1) qo ^ qi, qi^qi, ■■■ , qn-i -^qn, qo Premises 

(n + 2) q„ (l)-(n + l),p(n) 

(n + 3) q„ -> q„+i Premise 

(n + 4) q„+] (n + 2), (n + 3), detachment ■ 

Example 3.7.3. For all n > l,n^ + 2n is a multiple of 3. An inductive proof follows: 
Basis: l-* + 2 (1) = 3 is a multiple of 3. 
The basis is almost always this easy! 

Induction: Assume that n > 1 and + 2 n is a multiple of 3. Consider (n + 1)^ + 2 (n + 1). Is it a multiple of 3? 

(n + 1)3 + 2(n + 1) = (n^ + 3 + 3 n + 1) + (2n + 2) 

= + 2n + 3 11^ + i n + 3 Rearrange the terms 

= (n^ + 2n) + 3in^ + n + 1) 

Yes, (n + 1)' + 2 (n + 1) is the sum of two multiples of 3; therefore, it is also a multiple of 3. ■ 

Variations of Induction 

Now we win discuss some of the variations of the principle of mathematical induction. The first simply allows for universes that are similar to 
P, like {-2, -1, 0, 1, ...)or(5, 6, 7, 8, ...). 



Principle of Mathematical Induction (Generalized). If p (n) is a proposition over {Icq , ko + 1, ko + ■■■], where is any integer, then 

p (n) is a tautology if 

(1) p{ko) is true, and 

(2) for all n > ko, pin) => p(n + 1). 

Example 3.7 A. In Chapter 2, we stated that the number of different permutations of k elements taken from an n element set, P (n; k), can 
be computed with the formula — We can prove this statement by induction on n. For n > 0, let ^ (n) be the proposition 

(n-fc)! 

Pin; k) = — — for all A: from to n. 

(n-K)\ 

Basis: ^(0) states that 

P(0; 0) = the number of ways that elements can be 

selected from the empty set and arranged in order 
= 0!/0! = 1. 

This is true — a general law in combinatorics is that there is exacfly one way of doing nothing. 

Induction: Assume that q in) is true for some natural number n. It is left for us to prove that this assumption impUes that qin + 1) is true. 
Suppose that we have a set of cardinality n + I and want to select and arrange k of its elements. There are two cases to consider, the first of 
which is easy. If k = 0, then there is one way of selecting zero elements from the set; hence 

Pin + 1; 0) = 1 = 



(n+l+0)! 

and the formula works in this case. 

The more challenging case is to verify the formula when k is positive and less than or equal to n + 1 . Here we count the value of P(n + 1; k) 
by counting the number of ways that the first element in the arrangement can be filled and then counting the number of ways that the 
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remaining k - I elements can be filled in using the induction hypothesis. 

There are n + 1 possible choices for the first element. Since that leaves n elements to fill in the remaining k - 1 positions, there are 
P(n; k - 1) ways of completing the arrangement. By the rule of products, 

P{n +l;k) = in+ \)P{n;k - 1) 
= {n+\) — 

_ (n+l)ii! 
~ (n-/t+l)! 
_ («+!)! ^ 

A second variation allows for the expansion of the induction hypothesis. The course-of-values principle includes the previous generalization. It 
is also sometimes called strong induction. 

The Course-of- Values Principle of Mathematical Induction. If p(n) is a proposition over [ko , ko + \,kQ + 2, ...], where k^ is any 
integer, then p (n) is a tautology if 

(1) p(^o) is true, and 

(2) forall«>^o, p(ko), p(ko + 1), . . . , p (n) ^ p (n + 1). 

Example 3.7.5. A prime number is defined as a positive integer that has exactly two positive divisors, 1 and itself. There are an infinite 
number of primes. The list of primes starts with 2,3,5,7,11,... . The proposition over {2,3,4,.. .) that we will prove here is p(n): n can 
be written as the product of one or more primes. In most texts, the assertion that p (n) is a tautology would appear as: 

Theorem. Every positive integer greater than or equal to 2 has a prime decomposition . 

If you were to encounter this theorem outside the context of a discussion of mathematical induction, it might not be obvious that the proof can 
be done by induction. Recognizing when an induction proof is appropriate is mostly a matter of experience. Now on to the proof! 

Basis: Since 2 is a prime, it is already decomposed into primes (one of them). 

Induction: Suppose that for some ^ > 2 all of the integers 2, 3, . . . , k have a prime decomposition. Notice the course-of-value hypothesis. 
Consider k + \. Either ^ + 1 is prime or it isn't. If /; + 1 is prime, it is already decomposed into primes. If not, then ^ + 1 has a divisor, d, 
other than 1 and ^ + 1. Hence, k + \ = c d where both c and d are between 2 and k. By the induction hypothesis, c and d have prime 
decompositions , c i ct ■ ■ ■ c,„ and did2 ■■■ dm , respectively . Therefore ,k + l has the prime decomposition c i C2 ■ ■ ■ c,„ did2 ■■■ d,,, ■ ■ 

HISTORICAL NOTE 

Mathematical induction originated in the late nineteenth century. Two mathematicians who were prominent in its development were Richard 
Dedekind and Giuseppe Peano. Dedekind developed a set of axioms that describe the positive integers. Peano refined these axioms and gave a 
logical interpretation to them. The axioms are usually called the Peano Postulates. 

Peano's Postulates. The system of positive integers consists of a nonempty set, P; a least element of P, denoted 1; and a "successor function," 
s, with the properties 

(1) If k e P , then there is an element of P called the successor of k, denoted s (k). 

(2) No two elements of P have the same successor. 

(3) No element of IP has 1 as its successor. 

(4) If S c P, 1 e S, and k e S => i (k) e 5, then 5 = P. 




Richard Dedekind Giuseppe Peano 



Notes: 

(a) You might recognize s (k) as simply being k + I. 

(b) Axiom 4, mentioned above, is the one that makes mathematical induction possible. In an induction proof, we simply apply that axiom to the 
truth set of a proposition. 
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Exercises for Section 3.7 
A Exercises 

1 . Prove that the sum of the first n odd integers equals . 

2. Prove that if n > 1, then 1(1!) + 2(2!) + •••+ n(n!) = (n + 1)! - 1. 

3. Prove thatforn > 1: ^J^= 7n(n + l)(2n + 1). 

k=i * 

4. Prove that for n > 1: 2 2* = 2"+' - 1. 

k=0 

5 . Use mathematical induction to show that forn > 1 , 

1 1 1 _ _n_ 

1x2 2x3 n(n+l) n+l 

6. Prove that if n > 2, the generalized DeMorgan's Law is true: 

-■(Pl ApiA- APn)<^(-'Pl) V (-■P2) V ••• V(-'Pn) 

B Exercises 

7. The number of strings of n zeros and ones that contain an even number of ones is 2""^ . Prove this fact by induction forn > 1 . 

8. Let p (n) be 8" - 3" is a multiple of 5. Prove that p (n) is a tautology over fsl. 

9. Suppose that there are n people in a room, n > 1, and that they all shake hands with one another. Prove that "'"^ handshakes will have 
occurred. 

10. Prove that it is possible to make up any postage of eight cents or more using only three- and five-cent stamps. 

C Exercises 

11. Generalized associativity. It is well known that if oi, a2, and 03 are numbers, then no matter what order the sums in the expression 
fli + 02 + 03 are taken in, the result is always the same. Call this fact p(3) and assume it is true. Prove using course-of-values induction that if 
ai, 02, .... and a„ are numbers, then no matter what order the sums in the expression ai + 02 + ••• + a„ are taken in, the result is always the 
same. 

12. Let S be the set of all numbers that can be produced by applying any of the rules below in any order a finite number of times. 

Rule 1: - e 5 

2 

Rule 2: 1 e S 

Rule 3: If a and b have been produced by the rules, then ab & S. 

Rule 4: If a and b have been produced by the rules, then ^ e S. 

Prove by course-of-values induction that ae5=>0<a< 1. Hint: The number of times the rules are applied should be the integer that you do 
the induction on. 

13. A recursive definition is similar to an inductive proof. It consists of a basis, usually the simple part of the definition, and the recursion, 
which defines complex objects in terms of simpler ones. For example, if is a real number and n is a positive integer, we can define x" as 
follows: 

Basis: x' = x . 

Recursion: if n > 2,x" = x"'^ x . 

For example, x^ = x^ x = (x' x)x = (xx) x. Proofs involving objects that are defined recursively are often inductive. Prove that if n, m e P, 

A-'"+" = x" x". Hint: Let p (m) be the proposition that x"'+" = x™ x" for all n > 1 . There is much more on recursion in Chapter 8. 

14. Let 5 be a finite set and let P„„ be defined recursively by Pi =5 and P„ = 5xP„_i for n > 2. 

(a) List the elements of P3 for the case S = {a, b}. 

(b) Determine the formula for | /*„ |, given that 1 5 | =k, and prove your formula by induction. 



Applied Discrete Structures by Alan Doerr & Keimeth Levasseur is licensed under a Creative Commons Attribution-Nonconmiercial-ShareAlike 3.0 United States License. 



71 

Chapter 3 - Logic 



I 3.8 Quantifiers 



As we saw in Section 3.6, if p (n) is a proposition over a universe U, its tinth set Tp is equal to a subset of U. In many cases, such as when p (n) 
is an equation, we are most concerned with whether Tp is empty or not. In other cases, we might be interested in whether Tp = U; that is, 
whether p (n) is a tautology. Since the conditions Tp i=0 and Tp = U are so often an issue, we have a special system of notation for them. 

THE EXISTENTIAL QUANTIFIER 

If p(n) is a proposition over U with Tp t 0, we commonly say "There exists an n in (/ such that p(n) (is true)." We abbreviate this with the 
symbols (3 n)^ (p(n)). The symbol 3 is called the existential quantifier. If the context is clear, the mention of U is dropped: (3 n) (p(n)). 
Example 3.8.1. 

(a) (3 k)^ {k^ - k - 12 = Cij is another way of saying that there is an integer that solves the equation k2-k - 12 = 0. The fact that two such 

integers exist doesn't affect the truth of this proposition in any way. 

(b) (3k)z (3^ = 102) simply states that 102 is a multiple of 3, which is true. On the other hand, (3 ^)2 (ik= 100) states that 100 is a 
multiple of 3, which is false. 

(c) (3 x)r (x^ + 1 = 0) is false since the solution set of the equation x^ + 1 = in the real numbers is empty. It is common to write 
{$x)^(x^ + 1=0) in this case. 

There are a wide variety of ways that you can write a proposition with an existential quantifier. Table 3.8.1 contains a list of different variations 
that could be used for both the existential and universal quantifiers. 

THE UNIVERSAL QUANTIFIER 

If p (n) is a proposition over U with Tp = U, we commonly say "For all n in U, pin) (is ttue)." We abbreviate this with the symbols 
(V n)^ (p(n)) . The symbol V is called the universal quantifier. If the context is clear, the mention of U is dropped: (V n) (p(n)) . 
Example 3.8.2. 

(a) We can say that the square of every real number is non-negative symbolically with a universal quantifier: (V x) r (x ^ > 0). 

(b) (V n) z (n + = + n = n) says that the sum of zero and any integer n is n. This fact is called the identity property of zero for addition. 

Table 3.8.1 Notational Variations for Existential and Universal Quantifiers 

Universal Quantifier Existential Quantifier 

(yn)u(j>m (,3n)u(.pm 

(ynelDipm (3neJ/)(p(n)) 

V ne U, pin) 3ne U such that pin) 

pin), n G U p (n) is ttue for some n e U 
p in) is ttue for all n e i/ 



THE NEGATION OF QUANTIFIED PROPOSITIONS 

When you negate a quantified proposition, the existential and universal quantifiers complement one another. 

Example 3.8.3. Over the universe of animals, define F(x) : x is a fish and W(x) : x lives in the water. We know that the proposition W(x) F(x) 
is not always ttue. In other words, (Vx)(W(x) F(x)) is false. Another way of stating this fact is that there exists an animal that lives in the 
water and is not a fish; that is, 

-.(Vx)(W(x) ^ Fix)) « (3 x)i^iWix) ^ Fix))). 

<^ (3 x)iWix) A ^Fix)) 

Note that the negation of a universally quantified proposition is an existentiaUy quantified proposition. In addition, when you negate an 
existentially quantified proposition, you obtain a universally quantified proposition. Symbolically, 

- ((V n)u ipin)) ) « (3 «) y (- pin))), and 
-((3«)t;(p(«)))« (V n)ui^pin))) 
Example 3.8.4. 
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(a) The ancient Greeks first discovered that ^f2 is an irrational number; that is, V2" is not a rational number. -1 ((3 r)Q (r^ = 2)) and 
(V r)Q (r^ + 2) both state this fact symbolically. 

(b) ((V n)p (n ^ - n + 41 is prime)) is equivalent to (3 n)p (n^ - n + 41 is composite). They are both either true or false. 
MULTIPLE QUANTIFIERS 

If a proposition has more than one variable, then you can quantify it more than once. For example, if p (x, y) : - = (x + y) (x - y) is a 
tautology over the set of all pairs of real numbers because it is true for each pair (x, j) in IR x R. Another way to look at this proposition is as a 
proposition with two variables. The assertion that p{x,y) is a tautology could be quantified as (V x)r ((V y) ^ (p (x, y))) or 
(Vy)R((V;c)R(p(;c, y))) 

In general, multiple universal quantifiers can be arranged in any order without logically changing the meaning of the resulting proposition. The 
same is true for multiple existential quantifiers. For example, p(x, y) : x + y = 4andx - y - 2 is a proposition over IR x IR. 
(3 x\ ((3 y) ^ (x + y = 4 and x - y = 2)) and (3 y\ ((3 x) r (x + y = 4 and x - y = 2)) are equivalent. A proposition with multiple 
existential quantifiers such as this one says that there are simultaneous values for the quantified variables that make the proposition true. A 
similar example is ^ (x, y) : 2x - y - 2 and 4 x - 2y = 5, which is always false; and the following are all equivalent 

-((3 x)r((3 y)^{q{x, y)))) « - (3 y)^ ((3 x)r (<?(x, y)))) 

«(Vy)R (- ((3 x)^(q(x,y))) 
«((Vy)R((Vx)R(-9(x, y)))) 

«((Vx)R((Vy)R(-<i-(x, y)))) 

When existential and universal quantifiers are mixed, the order cannot be exchanged without possibly changing the meaning of the proposition. 
For example, let [R+ be the positive real numbers, x : (V a)^* ((3 b)^* (ab = 1)) and y : (3 b)^+ ((V a)^+ (ab = 1)) have different meanings; x 
is true, while y is false. 

TIPS ON READING MULTIPLY QUANTIFIED PROPOSITIONS 

It is understandable that you would find propositions such as x difficult to read. The trick to deciphering these expressions is to "peel" one 
quantifier off the proposition just as you would peel off the layers of an onion (but quantifiers shouldn't make you cry). Since the outermost 
quantifier in x is universal, x says that z (a) : (3 h)^^ (ab = 1) is true for each value that a can take on. Now take the time to select a value for 
a, like 6. For the value that we selected, we get z(6) ; (3 b)^* (6h = 1), which is obviously true since 6b = I has a solution in the positive real 
numbers. We will get that same truth value no matter which positive real number we choose for a; therefore, z (a) is a tautology over and we 
are justified in saying that x is true. The key to understanding propositions like x on your own is to experiment with actual values for the 
outermost variables as we did above. 

Now consider y. To see that y is false, we peel off the outer quantifier. Since it is an existential quantifier, all that y says is that some positive 

real number makes w(b) : a) (ab = 1) true. Choose a few values of b to see if you can find one that makes iv (b) true. For example, if we 
pick b = 2, we get (V a) (2 a = 1), which is false, since 2 a is almost always different from 1 . You should be able to convince yourself that 
no value of b will make vv (b) true. Therefore, y is false. 

Another way of convincing yourself that y is false is to convince yourself that -> y is true: 

- ((3 b)^. ((V a)R. (ab = m « (V b)^. - ((V a)^. (ab = 1)) 

«(V %.((3a)R.(afo # D) 

In words , for each value of , a value for a that makes ab I. One such value is a = 7 + 1 . Therefore , y is true . 

b 

EXERCISES FOR SECTION 3.8 
A Exercises 

1 . Let C (x) be "x is cold-blooded," let F (x) be "x is a fish," and let S (x) be "x Uves in the sea." 

(a) Translate into a formula: Every fish is cold-blooded. 

(b) Translate into EngUsh: (3 x) (S (x) A W) 
and(Vx)(F(x) ^ S(x)). 

2. Let M (x) be "x is a mammal," let A (x) be "x is an animal," and let W (x) be "x is warm-blooded." 

(a) Translate into a formula: Every mammal is warm-blooded. 

(b) Translate into English: (3 x) (A (x) f\ (^ M (x))). 

3. Over the universe of books, define the propositions B(x): x has a blue cover, M(x): x is a mathematics book, C(x): x is published in the 
United States, and if (x, y) : The bibUography of x includes y. Translate into words: 

(a) (3x)(-B(x)). 
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(b) (V;c)(MW AU{x)^ B{x)). 

(c) (,3x)(,M(,x) A -BW). 

(d) (,3y)(,(yx)mx)^R(.x,y))). 
Express using quantifiers: 

(e) Every book with a blue cover is a mathematics book. 

(f) There are mathematics books that are published outside the United States. 

(g) Not all books have bibliographies. 

revised 

4. Let the universe of discourse, U, be the set of all people, and let M{x, y) be "x is the mother of y." 

(a) Which of the following is a true statement? Translate it into English. 

(i) (E x)uiiVy%iMix,y))) 

(ii) C^yhiO x)u(M(x,y))) 

(b) Translate the following statement into logical notation using quantifiers and the proposition M(x, y) over U: "Everyone has a grandmother," 

5. Translate into your own words and indicate whether it is true or false that (3 «) ^ (4 «- - 9 = 0). 

6. Use quantifiers to say that Vs" is an irrational number. 

7. What do the following propositions say, where Uis the power set of {1, 2, ... , 9}? Which of these propositions are true? 

(a) (VA)f;(|A| # \A'\). 

(b) (3A)y(3fi)£;(|A| =5, =5, andAnB = 0) 

(c) (VA)uiyB)u{A-B = B'-A') 

8. Use quantifiers to state that for every positive integer, there is a larger positive integer. 

9. Use quantifiers to state that the sum of any two rational numbers is rational. 

10. Over the universe of real numbers, use quantifiers to say that the equation a + x = b has a solution for all values of a and b. Hint: You will 
need three quantifiers. 

1 1 . Let n be a positive integer. Describe using quantifiers: 

n 

(a) X e U 

*=! 

(b) X e ClAk 

k=l 

12. Provethat(3x)(V)')(p(x, y)) => {V y){3 x)(p{x, y)), but the converse is not true. 
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I 3.9 A Review of Methods of Proof 

One of the major goals of this chapter is to acquaint the reader with the key concepts in the nature of proof in logic, which of course carries over 
into all areas of mathematics and its applications. In this section we will stop, reflect, and "smell the roses," so that these key ideas are not lost 
in the many concepts covered in logic. In Chapter 4 we will use set theory as a vehicle for further practice and insights into methods of proof. 

KEY CONCEPTS IN PROOF 

1 . AU theorems in mathematics can be expressed in "If P then C" (P C) format, or in "Ci if and only if C2" (P C) format. The latter is 

equivalent to "If C] then C2, and if C2 then C] ." Alternate ways of expressing conditional propositions are found in Section 3.1. 

2. In "If P then C," P is the premise (or hypothesis) and C is the conclusion. It is important to realize that a theorem makes a statement that is 
dependent on the premise being true. 

3. There are two basic methods for proving P ^ C: 

(a) Direct: Assume P is true and prove C is true; and 

(b) Indirect (proof by contradiction): Assume P is true and C is false and prove that this leads to a contradiction of some premise, theorem, 
or basic concept. 

4. The method of proof for "If and only if" (iff) theorems is found in the law (P C) « ((P ^ C) A (C -> P)). Hence to prove an "If and 
only if statement one must prove an "if . . . then ..." statement and its converse. 

The initial response of most people when confronted with the task of being told they must be able to read and do proofs is: 

(a) Why? or, 

(b) I caimot do proofs. 

To answer the first question, problem solving, even on the most trivial level, involves being able to read statements. First we must understand 
the problem and know the hypothesis; second, we must realize when we are done and we must understand the conclusion. To apply theorems or 
algorithms we must be able to read theorems and their proofs intelligently. 

To be able to do the actual proofs of theorems we are forced to learn: 

(1) the actual meaning of the theorems, and 

(2) the basic definitions and concepts of the topic discussed. 

For example, when we discuss rational numbers and refer to a number x as being rational, this means we can substitute a fraction - in place of 

X, with the understanding that p and q are integers and q i^O. Therefore, to prove a theorem about rational numbers it is absolutely necessary 
that you know what a rational number "looks like." 

It's easy to comment on the response, "I caimot do proofs." Have you tried? As elementary school students we were in awe of anyone who 
could handle algebraic expressions, especially complicated ones. We learned by trying and applying ourselves. Maybe we carmot solve all 
problems in algebra or calculus, but we are comfortable enough with these subjects to know that we can solve many and can express ourselves 
intelligently in these areas. The same remarks hold true for proofs. 

THE ART OF PROVING P ^ C 

First one must completely realize what is given, the hypothesis. The importance of this is usually overlooked by beginners. It makes sense, 
whenever you begin any task, to spend considerable time thinking about the tools at your disposal. Write down the premise in precise language. 
Similarly, you have to know when the task is finished. Write down the conclusion in precise language. Then you usually start with P and 
attempt to show that C follows logically. How do you begin? Basically you attack the proof the same way you solve a complicated equation in 
elementary algebra. You may not know exactly what each and every step is but you must try something. If we are lucky, C follows naturally; if 
it doesn't, try something else. Often what is helpful is to work backward fi-om C. Finally, we have all learned, possibly the hard way, that 
mathematics is a participating sport, not a spectator sport. One learns proofs by doing them, not by watching others do them. We give several 
illustrations of how to set up the proofs of several examples. Our aim here is not to prove the statements given, but to concentrate on the logical 
procedure. 

Example 3.9.1. We will outline a proof that the sum of any two odd integers is even. Our first step will be to write the theorem in the 
familiar conditional form; If / and k are odd integers, then j + k is even. The premise and conclusion of this theorem should be clear now. 
Notice that if j and k are not both odd, then the conclusion may or may not be true. Our only objective is to show that the truth of the premise 
forces the conclusion to be true. Therefore, we can express the integers j and A: in the form that all integers take; that is: 

11 e Z is odd implies (3 m e Z) (n = 2m + 1). 
This observation allows us to examine the sum y + k and to verify that it must be even. 

Example 3.9.2. Let n e Z. We wMl outline a proof that is even if and only if n is even. 
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Outline of a proof: Since this is an "If and only if theorem we must prove two facts (see key concept number 4 above): 

I. (=>) If rp- is even, then n is even. To do this directly, assume that rp- is even and prove that n is even. To do this indirectly, assume is 
even and that n is odd, and reach a contradiction. It turns out that the latter of the two approaches is easiest here. 

n. («=) If n is even, then is even. To do this directly, assume that n is even and prove that is even. 

Now that we have broken the theorem down into two parts and know what to prove, we proceed to prove the two implications. The final 
ingredient that we need is a convenient way of describing even integers. When we refer to an integer n (or m, or k,. . . ) as even, we can 
always replace it with a product of the form 2 q, where q is an integer (more precisely, (3 q] ^ (n = 2 q)). In other words, for an integer to be 
even it must have a factor of two in its prime decomposition. 

Example 3.93. Our final example wiU be an outline of the proof that the square root of 2 is irrational (not an element of Q). This is an 
example of the theorem that does not appear to be in the standard P C form. One way to rephrase the theorem is: If is a rational number, 
then i=2. A direct proof of this theorem would require that we verify that the square of every rational number is not equal to 2. There is no 

convenient way of doing this, so we must turn to the indirect method of proof. In such a proof, we assume that x is a rational number and that 

x^ = 2 (i.e., V2~ is a rational number). This will lead to a contradiction. In order to reach this contradiction, we need to use the following facts: 

(a) A rational number is a quotient of two integers. 

(b) Every fraction can be reduced to lowest terms, so that the numerator and denominator have no common factor greater than 1 . 

(c) If n is an integer, is even if and only if n is even. 

EXERCISES FOR SECTION 3.9 
B Exercises 

1 . Prove that the sum of two odd positive integers is even. 

2. Write out a complete proof that if n is an integer, nP- is even if and only if n is even. 

3. Write out a complete proof that ^f2 is irrational. 

4. Prove that is an irrational number. 

5. Prove that if x and y are real numbers such that x + y < 1 , then either x< ^ory < i . 

6. Use the following definition of absolute value to prove the given statements: If x is a real number, then the absolute value of x, \x |, is 
defined by: 

I \ _ j X if X > 
I I 1 -jc if jc < 

(a) For any real number X, \ x\ > 0. Moreover, | x | = implies x = 0. 

(b) For any two real numbers X and J, |x| • \y\ = \ xy\. 

(c) For any two real numbers X and |x + >'|< |x| + |>'|. 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 3 

Section 3.1 

1. Construct the truth tables of 

(a) p V P 

(b) P A (- P) 

(c) P V (- P) 

(d) p hp 

2. Express each of the following in symbolic form and determine whether they are true or false: 

(a) If a, b e Z, and if a = or & = 0, then a ■ b = 0. 

(b) Ifa, & e Z.andifa - 6 = O.thena = Oorfc = 0. 

(c) Let a, b & J., a ■ b = OH and only if a = or = 0. 

(d) If {5}c Z.then 2 + 3 = 8. 

(e) If 2 + 3 = 8, then the world is flat. 

(f) 5 is an odd integer if and only if 8 is an even integer. 

Section 3.2 

3. Write the truth table for the expression p \J q i\ ^ p. 

4. Insert parentheses in the following statements to indicate the order in which the operations are performed: 

(a.) p \J q hr\J 

(b) pA^q\/^pAq 

(c) py q Ar 

(d) pAqMpAr yqAr 

Section 3.3 

5. Use truth tables to verify that {{p ^ -i ^) A V A (""")) => P is a tautology. 

6. Is an implication equivalent to its converse? Verify your answer using a truth table. 

7. Prove that an implication is always equivalent to its contrapositive. 

8. (a) Construct truth tables for the following propositions generated by p, q, and r. 

(i) r A(P Aq) (ii) r \J (p\l q) (iii) r A q 
(b) Which of the propositions i, ii, and iii in part (a) imply proposition i? Explain. 

9. Suppose that is a proposition generated by p and q, and x is equivalent to p ^ p A q- What is the truth table for x^ 

10. The Scheffer Stroke is the logical operator defined by the following truth table: 

P q Piq 
1 

1 1 

1 1 
1 1 

Truth Table for the Sheffer Stroke 

(a) Prove that p \ qis equivalent to -> (p A q)- 

The significance of the Sheffer Stroke is that it is a "universal" operator. AH other operators can be built from it. 

(b) Prove that p ^ p \ p. 

(c) Build A using only the Sheffer Stroke. 
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(d) Build V using only the Sheffer Stroke. 

Section 3.4 

1 1 . Write the negation of each of the following statements: 

(a) 3 is a prime number and it is even. 

(b) 4 is a prime number or it is odd. 

(c) If I can exhibit an example of a statement then I have proven it true. 

(d) Ifx^ - llx + 12 = O.thenx = 3orx = 8. 

12. Prove that all a defined by has the same property as the Sheffere Stroke (see problem 10) in that is is a "universal" operator. 



p 


1 


pf\1 








1 





1 


1 


1 





1 


1 


1 






Truth Table for pAq 

13. The following are frequently used and very important tautologies in logic. Use truth tables to prove them. 

(a) (p ^ ^) <=>(-. ^ ^ -. p) 

(b) ip^q)^(<ip^q)Aiq^p)) 

14. Write the following in symboUc notation and prove it is a tautology: "The statement p if and only if ^ is equivalent to saying if p then q 
and if-then p." 

15. Write the following in symbolic notation and determine whether it is a vaUd argument: "If 1 quit my job, then 1 wiU starve. If 1 don't do my 

work, then 1 must quit my job. 1 did my work, therefore 1 wiU eat." 

16. Write the dual of each of the following statements: 

(a) p V 0) ^ 1 

(b) (pV?)A yr)^{ryq) 

Section 3.5 

17. Write the following in symbolic form and then determine its validity. "If this car is made in England then parts are difficult to obtain. This 

car is expensive, or it is not difficult to obtain parts. But this car is not expensive. Hence it was not made in England." 

18. In order to attach the Mark 13 printer to the Lemon 111 computer, you must set eight "dip switches" in the computer according to the 
following rules. The switches are labeled a through h and are set to be either ON or OFF. 

(1) Neither a nor c is set the same as d. 

(2) b and g are different if and only if e and g are in the same positions. 

(3) g is OFF if d is OFF, but g is ON if bis OFF. 

(4) d is ON, unless e is the same as/. 

(5) h is not the same as a if either b or e is OFF. 

(6) g is OFF only if e is not the same as h. 

(7) b,f, and g are not all the same. 
How should the switches be set? 

19. Consider the following argument: 

If person X does not live in France, then X does not speak French. 

X does not drive a Chevrolet. 

If X Uves in France, then he rides a bicycle. 

Either X speaks French or he drives a Chevrolet. 
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Therefore, X rides a bicycle. 
Let p = "X lives in France." 
q = "X speaks French." 
r = "X drives a Chevrolet." 
s = "X rides a bicycle." 

Translate the argument into logical notation using these propositions, and prove it by any method except a truth table. 

20. Determine the validity of the following argument; "1 will miss class only if 1 sleep late. 1 will not sleep late. Therefore, 1 will not miss class." 

21 . Mayoral candidate Ms. Perpetual Candidate made the following promise to the voters: "If 1 am elected, 1 will bring industry to the town. If 1 
bring industry to the town, your taxes will decrease. If your taxes decrease, you will be wealthier. Therefore, if 1 am elected, you wiU be 
wealthier." Express this argument in symbolic notation and determine whether the mayoral candidate is telling the truth. 

22. Professor Smoothtalker made the following promise to his class. "If you receive an A in this coiu-se, you are happy. You wiU do all your 
assignments or you are not happy. If you concentrate too hard, you wiU not do all your assignments. Therefore, if you are happy, do not 
concentrate too hard." Is Professor Smoothtalker' s argument valid? 

23. Determine whether the following argument is valid: Taxes wiU increase or government spending decreases. Government spending increases 
or more people have jobs. More people do not have jobs or people are rich. Therefore, if taxes decrease, people are rich. 

Section 3.6 

24. Let p (n) hen < 2 and let q (n) be < 5 . 

(a) Over the universe of integers, Z, are p and q equivalent? Does one imply the other? 

(b) Over the universe of natural numbers, 1^, are and q equivalent? Does one imply the other? 

25. Prove that: T^^, = fl T,. 

26. Prove that: Tp^, = T/ U T^. 

Section 3.7 

27. Express 60 and 120 as a product of primes. 

28. Prove that forn > 1 

i=l 

29. (a) Prove that ( ^ " j ) + ( ^ ) = ( " ^ ' ) /t > 1 and n >k + 1. 
(b) Use mathematical induction to prove the binomial theorem: 

ix + yr= zil for n > 0. 

30. Use mathematical induction to prove for all n > 1 and for all real numbers c, a;, and fc,, = 1, 2, ...,n: 

(a) iiai + bi)=Y.ai+ibi 

(=1 i=l (=1 

n n 

(b) 2 cui = c Y,ai 

i=l i=l 

Section 3.8 

31 . Write the negation of: "Some sailing is dangerous and all fishing is tedious" in graceful English. 

32. Prove: - ((E u) (V v) (p)) « (V u) ((3 v) p)) 

33. Translate the following sentences into expressions using quantifiers: 

(a) All fish except sharks are kind to children. 

(b) Either every wine-drinker is very communicative, or some pawn broker is honest and doesn't drink wine. 

(c) If all clever philosophers are cynics, and only women are clever philosophers, then, if there are any clever philosophers, some 
women are cynics. 
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34. First write each of the following in logical notation using quantifiers, then write the negative of each logical expression using symbols and 
complete English sentences. If possible, determine which of the statements are true. 

(a) All people in this classroom are over six feet tall. 

(b) Some of the people in this classroom are over six feet tall and are bald. 

(c) Let U = [2, 3, 4, 5} and let p (n) denote the statement "n is a prime number." Apply the above directions to the statement: Every 

number in U is a prime number. 

(d) All prime numbers are odd. 

(e) You will all pass the course Discrete Structures or you will all fail. 

(f) You can fool some of the people some of the time. 

35. Use quantifiers to state that for every positive integer, there is a larger positive integer. 

36. Over the universe of students in your class, let N (x, y) be "x knows y's name." Interpret in English: 

(a) (3x)((Vj)(yV(x, y))) 

(b) (yy)((3x)(N(x, >')andx + y)) 

(c) How would you symbolically say that everyone knows everyone's name? 

(d) How would you symbolically say that everyone knows your name? 

(e) How would you symbolically say that someone in the class has amnesia? 

Section 3.9 

37. For any nonzero real number x, 

x^ = 1 x=lorx = -l 

(a) Outline the logical procedure you would use to prove this statement. 

(b) Fill in the proof. 

38. Let a, b, c e P and read a\b as "a divides evenly into b." Consider the statements 

(i) b \ a and a \ b implies a = b. 

(ii) If p is prime and p\a^ , then p | a. 

(a) Are these statements true? Explain your answers. 

(b) Is the converse of each of these statements true? Explain your answers. 

(c) Is the contrapositive of each of these statements true? Explain your answers. 

39. Letfl, h e R. A necessary and sufficient condition for a-b = Oisthata = Ootb = 0. 

(a) Is this statement true? Explain your answer. 

(b) Outline the logical procedure you would use to prove this statement. 
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MORE ON SETS 



GOALS 

In this chapter we shall look more closely at some basic facts about sets. One question we could ask ourselves is: Can we manipulate sets 
similarly to the way we manipulated expressions in basic algebra, or to the way we manipulated propositions in logic? In basic algebra we are 
aware that a-{b + c) = a-b + a- c for all real numbers a, b, and c. In logic we verified an analogue of this statement, namely, 
P A ( I? V '■) <=> (p A <?) V (p A '■))> where p, q, andr were arbitrary propositions. If A, B, and C are arbitrary sets, is 
A f] {B U C) = {A f] B) U (A f] C)l How do we convince ourselves of its truth or falsity? Let us consider some approaches to this 
problem, look at their pros and cons, and determine their validity. Many of the ideas expressed are true, in general, in mathematics. Partitions of 
sets and min sets will be introduced. 

I 4.1 Methods of Proof for Sets 

There are a variety of ways that we could attempt to prove that the distributive law for intersection over union is true; that is, that for any three 
sets. A, B, and C, A f] (B [j C) = (A f] B) [j (A f] C). We start with a common "non-proof and then work toward more acceptable 
methods . 

EXAMPLES AND COUNTEREXAMPLES 

We could, for example, let A = |1, 2), B = |5, 8, 10|, and C = |3, 2, 5|, and determine whether the distributive law is true. Obviously, in 
doing this we will have only determined that the distributive law is true for this one example. It does not prove the distributive law for all 
possible sets A, B, and C and hence is an invalid method of proof. However, trying a few examples has considerable merit insofar as it makes us 
more comfortable with the statement in question, and indeed if the statement is not true for the example, we have disproved the statement. 

Definition: Counterexample . An example that disproves a statement is called a counterexample . 

Example 4.1.1. From basic algebra we learned that multiplication is distributive over addition. Is addition distributive over multiplica- 
tion; that is, is a + {b ■ c) = (a + b) ■ {a + c)l If we choose the values a = 3, b = A, and c = 1, we find that 
3 4- (4 • 1) (3 -I- 4)-(3 4- 1). Therefore, this set of values serves as a counterexample to a distributive law of addition over multiplication. 

PROOF USING VENN DIAGRAMS 

In this method, we illustrate both sides of the statement via a Venn diagram and determine whether both Venn diagrams give us the same 
"picture," For example, the left side of the distributive law is developed in Figure 4.1.1 and the right side in Figure 4.1.2. Note that the final 
results give you the same shaded area. 
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Figure 4.1 .1 Lett side ot distributive law developed 

The advantage of this method is that it is relatively quick and mechanical. The disadvantage is that it is workable only if there are a small 
number of sets under consideration. In addition, it doesn't work very well in a static environment like a book or test paper. Venn diagrams tend 
to work well if you have a potentially dynamic environment like a blackboard or video. 




Figure 4.1.2 Rigtit side of distributive law developed 



PROOF USING SET-MEMBERSHIP TABLES 

Let A be a subset of a universal set U and let ue U. To use this method we note that exactly one of the following is true: u e A or « ^ A. 
Denote the situation where w e A by 1 and that where « ^ A by 0. Working with two sets, A and B, and if m e U, there are four possible 
outcomes of "where u can be." What are they? The set-membership table for A IJ B is : 

A B AUB 


1 1 

1 1 
1 1 1 

This table illustrates that ue A IJ 5 if and only if a e A or ii e B. 

In order to prove the distributive law via a set-membership table, write out the table for each side of the set statement to be proved and note that 
if S and T are two columns in a table, then the set statement S is equal to the set statement T if and only if corresponding entries in each row are 
the same. 

To prove AHCB U C) = (AHB) U (A RC), first note that the statement involves three sets. A, B, and C, So there are 2^ = 8 possibilities for the 
membership of an element in the sets . 
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Since each entry in Column 7 is the same as the corresponding entry in Column 8, we have shown that A[j (B [j C) = (Af] B) [j (A f] C) 
for any sets A, B, and C. The main advantage of this method is that it is mechanical. The main disadvantage is that it is reasonable to use only 

for a relatively small number of sets. If we are trying to prove a statement involving five sets, there are 2^ = 32 rows, which would test anyone's 
patience doing the work by hand. 



PROOF USING DEFINITIONS 

This method involves using definitions and basic concepts to prove the given statement. This procedure forces one to learn, relearn, and 
understand basic definitions and concepts. It helps individuals to focus their attention on the main ideas of each topic and therefore is the most 
useful method of proof. One does not learn a topic by memorizing or occasionally glancing at core topics, but by using them in a variety of 
contexts. The word proof panics most people; however, everyone can become comfortable with proofs. Do not expect to prove every statement 
immediately. In fact, it is not our purpose to prove every theorem or fact encountered, only those that illustrate methods and/or basic concepts. 
Throughout the text we will focus in on main techniques of proofs. Let's illustrate by proving the distributive law. 

Proof Technique 1. State or restate the theorem so you understand what is given (the hypothesis) and what you are trying to prove (the 
conclusion). 

Theorem 4.1.1. If A, B, and C are sets, then A f] (B \J C) = (A f] B) \J (A f] Q 
Assume: A, B , and C are sets. 
Prove: A R (5 U C) = (A R S) U (A R C). 

Commentary: What am I trying to prove? What types of objects am I working with: sets? real numbers? propositions? The answer is sets: sets 
of elements that can be anything you care to imagine. The universe from which we draw our elements plays no part in the proof of this theorem. 

We need to show that the two sets are equal. Let's call them the left-hand set (L.H.S.) and the right-hand set (R.H.S. ) To prove that 
LUS. = R.H.S., we must prove two things: (a) LH.S. c R.H.S. and (b) R.H.S. c L.H.S. 

To prove part a and, similarly, part b, we must show that each element of L.H.S. is an element of R.H.S. Once we have diagnosed the problem 
we are ready to begin. 

Proof of Theorem 4.1.1: We must prove: 

(a) A n (5 U C)c(AnB) U (A HQ. 

Let X e A n(5 U C) to show x e (A f] B) {J {A f] Q. 

X e A n (B U C) 
Definition of U > Pi => x e A and (x e B or x e C) 

Distributive Law of Logic => (x e A and x e 5) or (x e A andx e C) 
Definition of n => (x e A fl B) or(x e A fl C) 

Definition of U ^ x e (Af] B) \J (A f] C) 

and(b) (A n B) U (A n C) c A n (B U C) 

xe(AnB) u (A nc) 

Why? ^(xeAnB)or(xeAnC) 

Why ? => (x e A and x e B) or (x e A and x e C) 

Why? =>xeAand(xeSorxeC) 

Why? ^xeAn(BUC) ■ 

Proof Technique 2 . 

(1) To prove that A c B,we must show that ifx e A, then x E B. 

(2) To prove that A = B,we must show: 

(a) A c B, and 

(b) B cA. 

To further illustrate the Proof-by-Definition technique, let's prove the following: 
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Theorem 4.1.2. Let A,B, and C be sets, then 
Ax(B f] C) = (AxB) f] (Ax C). 

Commentary; We again ask ourselves: What are we trying to prove? What types of objects are we dealing with? We realize that we wish to 

prove two facts: (a) L.H.S. c R.H.S. and (b) R.H.S. c L.H.S. 

To prove part a, and similarly part b, we'll begin the same way. Let 

6 LJI.S. to show e RJI.S. What should be? 

What does a typical object in the LJI.S. look like? 
Proof of Theorem 4.1 .2: We must prove: 
(a)Ax(Bri QQ(AxB)r\(AxQ. 

h&tix, y) e Ax(B n C) to prove (;«:, y) e (AxB) fl (AxC). 
(x, y) eAx{B HQ. 



Why? => X e AsLTidy e (Bf] C). 

Why? jc e Aand (j e fi andj e C). 

Why? (x e Aandy e B)and(x e Aandy eC). 

Why? (x, y) e (AxB) and (x, y) e (AxC). 

Why? ^ (x, j) e (Ax B) n(AxC). 

and(b) (Ax B) fl (AxC) c Ax(B fl C). 

Let(jt:, >■) e (AxB) n (A x C) to prove (x, y) e A fl (BxC). 

(x, j) e (AxB)n(AxC). 

Why? (x, >•) e AxB and(x, j) e AxC. 

Why? (x e Aandy e B)and(x e Aandy e C). 

Why? X e Aand(y e B andj e C). 

Why? =i> X e A and j e (B fl Q- 

Why? =* (x, >>) e Ax(Bn Q ■ 



EXERCISES FOR SECTION 4.1 
A Exercises 

1 . Prove the following: 

(a) Let A, B, and C be sets. IfAcBandBcC, then A c C. 

(b) Let A and B be sets. Then A - B= AflS'^ • 

(c) LetA.B, andCbesets.If(AcBandAcOthenAcBriC. 

(d) Let A and B be sets . A c B If and only if B"^ c A'^ . 

(e) LetA.B, andCbesets.If AcB thenAxC cBxC. 

2. Write the converse of parts (a), (c), and (e) of Exercise 1 and prove or disprove them. 

3. Disprove the following, assuming A, B, and C are sets; 

(a) A - B = B - A. 

(b) AxB = BxA. 

(c) A n B = A n C implies B = C. 

4. Let A, B, and C be sets. Write the following in "if . . . then . . ." language and prove: 

(a) X e B is a sufficient condition for x e A (J 

(b) AnBnC = 0isa necessary condition for A H B = 0- 

(c) A \J B = Bisa necessary and sufficient condition for AcB. 

B Exercises 

5. Prove by induction that if A, Bi B2 , . . . , B„, are sets, n > 2, then 
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I 4.2 Laws of Set Theory 

The following basic set laws can be derived using either the Basic Definition or the Set-Membership approach and can be illustrated by Venn 
diagrams. 

Commutative Laws 

{1)A\JB = B\JA (V)Ar\B=Br\A 

Associative Laws 

(2) A U (B U C)= (AUB)UC (2')An(BnC) = (AnB)nC 
Distributive Laws 

(3) AfKB U C) = (AnB)U(An C) (3') A U (B H Q = (A U B ) fl (AU Q 

Identity Laws 

(4) A U = U A = A. (4') Ar\U = Ur\A = A 

Complement Laws 

(5) A U A<^ = i/ (5') A n A<^ = 

Idem potent Laws 

(6) A U A = A (6') A n A = A 

Null Laws 

(7) Ail U =U (7') A n = 

Absorption Laws 

(8) A U (A n B) = A. (8') A n (A U B) = A. 

DeMorgan's Laws 

(9) (A U By = A"^ n B^ . (9') (A fl By = A^ U B^ 

Involution Law 

(10) (A^y = A. 

It is quite clear that most of these laws resemble or, in fact, are analogues of laws in basic algebra and the algebra of propositions. 
PROOF USING PREVIOUSLY PROVEN THEOREMS 

Once a few basic laws or theorems have been established, we frequently use them to prove additional theorems. This method of proof is 
sometimes more efficient than that of Proof by Definition. To illustrate, let us prove the following: 

Theorem 42.1 . Let A and B be sets. Then (A fl B) U (A fl B' ) = A. 

Proof : (A n B) U (A n B" ) = A n (B U B'' ) Why? 

= Ai^U Why? 
= A Why? ■ 

PROOF USING THE INDIRECT METHOD/ CONTRADICTION 

The procedure one most frequently uses to prove a theorem in mathematics is the Direct Method, as illustrated in Theorems 4.1.1 and 4.1.2. 
Occasionally there are situations where this method is not applicable. Consider the following: 

Theorem 422. Ut A, B, Cbesets.IfAcBandBOC = 0,then A{\C = 0. 

Commentary: The usual and first approach would be to assume A c B and B p] C = is true and to attempt to prove A p] C = is true. To 
do this you would need to show that nothing is contained in the set A p] C. Think about how you would show that something doesn't exist. It 
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is very difficult to do directly. 

The Indirect Method is much easier: If we assume the conclusion is false and we obtain a contradiction— then the theorem must be true. This 
approach is on sound logical footing since it is exactly the same method of indirect proof that we discussed in Section 3.5. 

Proof of Theorem 422; Assume Ac B and BC\C = 0, and AC\C + 0.To prove that this cannot occur, let ;c e A H C. 

Why? => X e A and x e C, 
Why? ^ X e B and X e C. 
Why? ^ X e 5 n C. 
But this contradicts the second premise. Hence, the theorem is proven. ■ 

EXERCISES FOR SECTION 4.2 

In the exercises that follow it is most important that you outline the logical procedures or methods you use. 

A Exercises 

1 . (a) Prove the associative law for intersection (Law 2') with a Venn diagram. 

(b) Prove DeMorgan's Law (Law 9) with a membership table. 

(c) Prove the Idempotent Law (Law 6) using basic definitions. 

2. (a) Prove the Absorption Law (Law 8') with a Venn diagram. 

(b) Prove the Identity Law (Law 4) with a membership table. 

(c) Prove the Involution Law (Law 10) using basic definitions. 

3 . Prove, using the set theory laws, as well as any other theorems proved so far; 

(a) A U (B - A) = A U B 

(b) A - B = B' - A . 

(c) AcB, AnC*0^BnC *0 

(d) A n (B - c) = (A n B) - (A n c). 

(e) A - (B U C) = (A - B) n (A - C) 

4. Use previously proven theorems to prove: 

(a) A n(Bnc)'^= (AnB<^)U(Anc<^) 

(b) A n(Bn(AnBr)= 

(c) (A n B) U B^ = A U B<^ 

(d) A U (B - C) = (A U B) - (C - A). 

5. Hierarchy of Set Operations. The rules that determine the order of evaluation in a set expression that involves more than one operation are 
similar to the rules for logic. In the absence of parentheses, complementations are done first, intersections second, and unions third. Parentheses 
are used to override this order. If the same operation appears two or more consecutive times, evaluate fi-om left to right. In what order are the 
following expressions performed? 

(a) A U B<^ n C. 

(b) ApB U CflB. 

(c) A U B U C<^ . 

C Exercise 

6. There are several ways that can be used to format the proofs in this chapter. One that should be familiar to you from Chapter 3 is illustrated 
with the following proof. Alternate proof of part (a) in Theorem 4.1.1: 

(1) X e A n (B U C) Premise 

(2) (x e A) A (J^ e B U Q (1), definition of intersection 

(3) (x e A) A((^ e B) V (^ e C)) (2), definition of union 
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(4) (;c e A) A e B) V e A) A e C) (3), distribute A over V 

(5) (x e Af]B)V(x e A C\C) (4), defmition of intersection 

(6) ;c e (A nS) U (A n C) (5), definition of union 
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I 4.3 Minsets 

Let Bi and B2 be subsets of a set A. Notice that the Venn diagram of Figure 4.3.1 is naturally partitioned into the subsets Ai, A2, A3, and A4. 
Further we observe that Ai , A2, A3, and A4 can be described in terms of Bi and B2 as follows: 

Ai=BiUB2'^ 
A2=Bi 
A3= Bi'^nB2 
A4= Bi'^nB2'^ 



Bi 




B2 




A 


Ai 


A2 




A3 


A4 



Figure 4.3.1 



Each A, is called a minset generated by B] and B2 . We note that each minset is formed by taking the intersection of two sets where each may 
be either B^. or its complement Bjf Note also, given two sets, there are 2^ =4 minsets. 
Minsets are occasionally called minterms. 

The reader should note that if we apply all possible combinations of the operations intersection, union, and complementation to the sets Bi and 
B2 of Figure 4.3.1, the smallest sets generated will be exactly the minsets, the minimum sets. Hence the derivation of the term minset. 

Next consider the Venn diagram containing three sets, Bi, B2, and B3. What are the minsets generated by Bi, B2, and B3? How many are 
there? Following the procedures outlined above, we note that 

B, n S2 n 
B, n fi^ n B3 
B, n n B'3 

are three of the 2' = 8 minsets. See Exercise 1 of this section. 

Definition: Minset. Let {Bj, Bj,... ,B„^ be a set of subsets of a set A. Sets of the form Di H fl ••• H 0„, where each Di, may be 
either Bi or B'' is called a minset generated by Bj, B2,... ,B„. 

Example 4.3.1. For another view, consider the following: Let A = {1, 2, 3, 4, 5, 6) with subsets Bi = {1, 3, 5) and B2 = {1, 2, 3). 
How can we, using set operations applied to B\ and B2 , produce a list of sets that contain elements of A efficiently without duplication? As a 
first attempt, we note that: 

Bi nB2 = {l,3}, 
Bf = {2, 4, 6} , and 
= {4, 5, 6}. 

We have produced aU elements of A but we have 4 and 6 repeated in two sets. In place of Bf and B^ , let us try Bf H B2 and B\ H ^2. 
respectively: 

Bf n B2 = {2} and 
Bi n = {5}. 

We have now produced the elements 1,2,3, and 5 using Bi p] B2 , Bf p] B2 and Bi H B2 yet we have not listed the elements 4 and 6. Most 
ways that we could combine Bi and B2 such as Bi U B2 or Bi U B2 will produce duplications of listed elements and will not produce both 4 
and 6. However we note that Bf 06^= {4, 6), exactly the elements we need. Bach element of A appears exactly once in one of the four 
minsets Bi p] B2 , Bf p] B2, Bi p] B^ and Bf P B^ . Hence, we have a partition of A. 

Theorem 43.1. Let Abe a set and let Bj, B2 ... , B„ be subsets of A. The set of nonempty minsets generated by Bj, B2 ... , Bn is a 

partition of A. 
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The proof of this theorem is left to the reader. The most significant fact about minsets is that any subset of A that can be obtained from -81,-82 
... ,B„, using the standard set operations can be obtained in a standard form by taking the union of selected minsets. 

Definition: Minset Normal Form. A set is said to be in minset normal form when it is expressed as the union of zero or more distinct 
nonempty minsets. 

Notes: 

(a) The union of zero sets is the empty set, 0. 

(b) Minset normal form is also called canonical form. 

Example 432. Let i/ = {-2, -1, 0, 1, 2},Bi = {0, 1, 2}, andB2 = {0, 2}. Then 
Bi n B2 = {0, 2} 

Bf n B2 = 

B, 85 = {1} 

B\ n 85 = {-2, -1) 

In this case, there are only three nonempty minsets, producing the partition ||0, 2), {1), {-2, -1)). An example of a set that could not be 
produced from just 81 and 82 is the set of even elements of U, (-2, 0, 2). This is because -2 and -1 cannot be separated — they are in the 
same minset and any union of minsets needs either include or exclude them both. In general, there are 2^ = 8 different minset normal forms 
because there are three nonempty minsets. This means that only 8 of the 2' = 32 subsets of U cdxi be generated from 81 and 82. 

EXERCISES FOR SECTION 4.3 
A Exercises 

1. Consider the subsets A = {1, 7, 8},B = {1, 6, 9, 10},andC = {1, 9, 10}, where i/ = {1,2, 10}. 

(a) List the nonempty minsets generated by A, B, and C. 

(b) How many elements of the power set of U can be generated by A, B, and C? Compare this number with | P(U) \. Give an example of one 
subset that cannot be generated by A, B, and C. 

2. (a) Partition {1,2, ....9} into the minsets generated by Bi = {5, 6,7}, B2 = {2, 4, 5, 9}, mdB^ = {3, 4, 5, 6, 8, 9}. 

(b) How many different subsets of {1, 2, 9} can you create using Bi, B2, and B3 with the standard set operations? 

(c) Do there exist subsets Ci , C2, C3 whose minsets wiU generate every subset of { 1 , 2, . . . , 9}? 

3. Partition the set of strings of O's and I's of length two or less, using the minsets generated by Bi = {s \ jhaslength2}, and 
82 = U I i starts with a 0}. 

4. Let Bi , B2, and B3 be subsets of a universal set U, 

(a) Find all minsets generated by Bi , B2, and B3. 

(b) Illustrate with a Venn diagram all minsets obtained in part (a). 

(c) Express the following sets in minset normal form: 8', , 81 H 82 ,81 U 82- 

5. (a) Partition A = {0, 1, 2, 3, 4, 5) with the minsets generated by Bi = {0, 2, 4} and B2 = (1, 5}. 
(b) How many different subsets of A can you generate from Bi andB2? 

B Exercises 

6. If {Bi, B2, B„}isapartitionof A, how many minsets are generated by Bi, B2, B„? 

7. Prove Theorem 4.3.1. 

C Exercise 

8. Let S be a finite set of n elements. Let 8; „ i = 1, 2, ... ,kbe nonempty subsets of S. There are 2^ minset normal forms generated by the k 
subsets. The number of subsets of S is 2". Since we can make 2^* > 2" by choosing k > logj n, it is clear that two distinct minset normal-form 
expressions do not always equal distinct subsets of S. Even for k < log2 n, it may happen that two distinct minset normal-form expressions 
equal the same subset of S. Determine necessary and sufficient conditions for distinct normal-form expressions to equal distinct subsets of S. 
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I 4.4 The Duality Principle 

In Section 4.2 we observed that each of the set laws labeled 1 through 9 had an analogue 1' through 9'. We notice that each of the laws in one 
column 2 can be obtained from the corresponding law in the other column by replacing U by Pi, (Pi by U. by U, U by 0, and leaving the 
complement unchanged. 

Definition: Duality Principle for Sets. Let S be any identity involving sets and the operations complement, intersection and union, . If S* 
is obtained from S by making the substitutions IJ ->f],(~]->lj,0->U, and U ^ 0, then the Statement S* is also true and it is called the 
dual of the Statement S. 

Example 4.4.1. The dual of (A f] B) [j (A f] B'' ) = A is (A U 5) fl (A U 5') = A 

One should not underestimate the importance of this concept. It gives us a whole second set of identities, theorems, and concepts. For example, 
we can consider the dual of minsets and minset normal form to obtain what is called maxsets and maxset normal form. 

EXERCISES FOR SECTION 4.4 
A Exercises 

1. State the dual of: 

(a) A U (B n A) = A. 

(b) A U m U A) n Bf = U, 

(c) (A U B'T f] B =A'r\B 

2. Consider Table 3.4.1 and then write a description of the principle of duality for logic. 

3. Write the dual of: 

(a) p V - ((- <? V P) A <?) « 1 

(b) (-(p A (- q)) V V q)- 

B Exercises 

4. Use the principle of duality and the definition of minset to write the definition of maxset. {Hint, just replaceflby U •) 

5. Let A = |1,2, 3,4, 5, 6) and let 6i = (1, 3, 5)andB2 = {1,2, 3). Find the maxsets generated by Bi and 62- Note the set of maxsets 
does not constitute a partition of A. Can you explain why? 

(a) Write out the definition of maxset normal form. 

(b) Repeat Problem 4 of Section 4 .3 for maxsets . 

6 . Is the dual of Exercise 5 of Section 4 . 1 true? 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 4 

Section 4.1 

1 . Let A and B be subsets of a universal set U. Use basic definitions to prove each statement. Be sure to outline carefully the logical structure 

of the proof. 

(a) AqS if and only if A CliU - B) = 0. 

(b) If i/ = A U BandA n B = 0, then A = U - B. 

(c) A and B are disjoint if and only if A c ?7 — B. 

2. Let A and B be subsets of a universal set U. Verify the statements using a Veim diagram. 

(a) AcBifandonlyif A C\B = A. 

(b) AcBifandonlyif A [J B = B. 

(c) A n B c A U B. 

3 . (a) Prove that if A, B, and C are sets, then (A U B) x C is a subset 

of(AxC) U (BxC). 

(b) Explain how you would proceed following part a if you wanted to prove that (A U B)xC = (AxC) U (BxC). 

4. Simplify the following: 

(a) (A n B'' n c) u B u (B n c) u (A'' n q. 

(b) (A u (B n c')) n ((A'' n b'') u c). 

Section 4.2 

5 . Prove with an indirect proof thai if A, B, and C are subsets of universe A is a subset of B, and B is a subset of C, then is a subset of A'^. 

6. Basic laws and theorems in different algebraic structures can be recalled easily by thinking in terms of analogous situations in elementary 
algebra. 

(a) Complete the following table: 





Algebra 


Algebra 


Elementary 




of Sets 


of Logic 


Algebra 


Objects Used 


Sets 




Real Numbers 


Basic Operations 


u 


A 










+ 

- (or multiplicative 








inverse 


Other Connectives 


c 


•» 





(b) Write analogous statements in the algebras of sets and logic. 
Which are true? 

(i) If X, J, and z are real numbers and Hx < y and y ^z, then x^z. 

(ii) -(-x) = X for any real number x. 

(iii) \ix ,y, and z are real numbers and x + y = x + z, then y = z ■ 

(iv) Letx, J e R.x = J if and only if x ^ j and y ^ x. 

(v) For X e R, x + = + x = x and x ■ 1 = 1 ■ x = x. 
7 . Prove or disprove: 

(a) Let A, B, and C be sets. If A U C # B U C.thenA # B. 
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(b) If A t B,thenA'^ tB'. 

8 . Let {Ai, A2, .. .A„} be a partition of set A, and let B be any nonempty subset of A, Prove that {A,- H B | A; H B * 0) is a partition of A f) B- 
Section 4.3 

9. Let (/ = |1, 2, 3, 4, 5, 6),B, = (1, SJ.andSz = (2, 3, 5}. 

(a) List the minsets generated by B\ and 62- 

(b) Show that the set of minsets form a partition of U. 
Section 4.4 

10. State the dual of each statement in Exercise 3, Section 4.2. 

1 1 . Show that the dual of each of the basic set laws 1 through 9 are the corresponding laws 1' through 9'. 
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INTRODUCTION TO MATRIX ALGEBRA 



GOALS 



The purpose of this chapter is to introduce you to matrix algebra, which has many applications. You are already familiar with several algebras: 
elementary algebra, the algebra of logic, the algebra of sets. We hope that as you studied the algebra of logic and the algebra of sets, you 
compared them with elementary algebra and noted that the basic laws of each are similar. We will see that matrix algebra is also similar. As in 
previous discussions, we begin by defining the objects in question and the basic operations. 



Definition: Matrix. A matrix is a rectangular array of elements of the form 



a,, 


a,2 


a, 3 ■ 


•• "in 


"2J 




"23 ■ 


■■ "2n 




"32 


"33 ■ 


■■ "3„ 




"ml 


'^m3 • 





A convenient way of describing a matrix in general is to designate each entry via its position in the array. That is, the entry 034 is the entry in 
the third row and fourth column of the matrix A. Depending on the situation, we will decide in advance to which set the entries in a matrix will 
belong. For example, we might assume that each entry Oy (1 < / < m, 1 < j < n) is a real number. In that case we would use M„,x„(IR) to 

stand for the set of all m by n matrices whose entries are real numbers. If we decide that the entries in a matrix must come from a set 5, we use 
M,„x„{S) to denote all such matrices. 

Definition: Order of a Matrix. The matrix A that has m rows and n columns is called an m xn (read "m by n") matrix, and is said to 
have order m x n. 

Since it is rather cumbersome to write out the large rectangular array above each time we wish to discuss the generalized form of a matrix, it is 
common practice to replace the above by A = [cy]. In general, matrices are often given names that are capital letters and the corresponding 
lower case letter is used for individual entries. For example the entry in the third row, second column of a matrix called C would be C32. 
Example 5.1.1. 



are 2 X 2, 3 X 1 , and 3x3 matrices, respectively 

Since we now understand what a matrix looks like, we are in a position to investigate the operations of matrix algebra for which users have 
found the most applications. 



5.1 Basic Definitions 




Example 5.1.2 



First we ask ourselves: Is the matrix A = 




1 



2 



the matrix B = 



3 



5 



? No, they are not because the corresponding entries in the second row, second column of the two matrices are not 




2 3 
5 6 



equal to B 



4 5 



1 2 



? No, although the corresponding entries in the first two columns are identical, B doesn't 
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have a third column to compart to that of A. We formalize these observations in the following definition. 

Definition; Equality of Matrices. A matrix A is said to be equal to matrix B ( written A = B) if and only if: 

(1) A and B have the same order, and 

(2) all corresponding entries are equal: that is, atj = bijfor all appropriate i and j. 

I 5.2 Addition and Scalar Multiplication 

Example 5^.1. Concerning addition, it seems natural that if 

I I \ /'34\ ('1+30 + 4 \ /44\ 

( 1 2 3 \ ( 3 \ 

However, if A = I ^ ^ ^ I ^^d ^ = I 2 g I' ^® A + B? No, the orders of the two matrices must be identical. 

Definition: Matrix Addition. Let A and B be mxn matrices. Then A + B is an mxn matrix where (A + B)jj = ay + bij ( read "the ith 

jth entry of the matrix A + B is obtained by adding the ith jth entry of A to the ith jth entry ofB"). If the orders of A and B are not identical, 
A+B is not defined. 

It should be clear from Example 5.2.1 and the definition of addition that A + B is defined if and only if A and B are of the same order. 

Another frequently used operation is that of multiplying a matrix by a number, commonly called a scalar in this context. Scalars normally come 
from the same set as the entries in a matrix. For example, if A e M„x„(R), a scalar can be any real number. 

/ I -2\ 

Example 5.2.2. If c = 3 and if A = I ^ ^1 and we wish to find cA, it seems natural to multiply each entry of A by 3 so that 

(3 -6 \ 

3 A = I ^ 15 j ' ^"'^ ^^^^ ^'^ precisely the way scalar multiplication is defined. 

Definition: Scalar Multiplication. Let A be an m x n matrix and c a scalar. Then c A is the m xn matrix obtained by multiplying c times 
each entry of A; that is (cA)y = ccy. 



I 5.3 Multiplication of Matrices 



■ For a video introduction to this section, go to http://facuity.umi.edu/i<levasseur/ads2/videos/matrixmuitipiication/ 



A definition that is more awkward to motivate (and we will not attempt to do so here) is the product of two matrices. In time, the reader will see 
that the following definition of the product of matrices will be very useful, and will provide an algebraic system that is quite similar to elemen- 
tary algebra. 

Definition: Matrix Multiplication. Let Abe an m xn matrix and let B be an nxp matrix. The product of A and B, denoted by AB, is an 
mxp matrix whose ith row jth column entry is 

(AB)ij = aiibij +aj2b2j + ■■■ +ainbnj 

n 

for 1 <i <m and 1 < j < p. 

The mechanics of computing one entry in the product of two matrices is illustrated in Figure 5.3.1 . 
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Col. 2 



Row 1-*^ 







-1 2-1 
0-1 I 



-6 
3 



4 
6 

5 



Row I, Col. 2 or Producl 

I 

(IH2) + ( - 1)(3) ^ (0)14) * 



• -1 * 

* * • 



Figure 5.3.1 Computation of one entry in the product of two 3 by 3 matrices 



The computation of a product can take a considerable amount of time in comparison to the time required to add two matrices. Suppose that A 
and B axe nxn matrices; then (AB)y is determined performing n multiplications and n - 1 additions. The full product takes multiplications 

and — additions. This compares with additions for the sum of two nxn matrices. The product of two 10 by 10 matrices will require 
1,000 multiplications and 900 additions, clearly a job that you would assign to a computer. The sum of two matrices requires a more modest 
100 additions. This analysis is based on the assumption that matrix multiplication will be done using the formula that is given in the definition. 
There are more advanced methods that, in theory, reduce operation counts. For example, Strassen's algorithm 

(http://mathworld.wolfram.com/StrassenFormulas.html) computes the product of two n hy n matrices in 7-7'"=2" -6 - 4^°h" » 7^^^°^ opera- 
tions. There are practical issues involved in actually using the algorithm in many situations. For example, round-off error can be more of a 
problem than with the standard formula. 



Example 5.3.1. Let A 



\ 



1 

3 2 
-5 1 



, a 3 X 2 matrix, an let B 



, a 2 X 1 matrix. Then AB is a 3 x 1 matrix: 



f 1 



AB = 



^ 
3 2 
-5 1 



1-6-I-0-1 ^ 




f ^ 1 


2-1 4-3 -6 




20 


-5 -6 M , 




, -29 , 



Remarks: 



(1) The product AB is defined only if A is an mxn matrix and B is an nxp matrix; that is, the two "inner" numbers must be the equal. 
Furthermore, the order of the product matrix A fi is the "outer" numbers, in this case mxp. 

(2) It is wise to first determine the order of a product matrix. For example, if A is a 3x2 matrix and 5 is a 2x2 matrix, then A5 is a 3x2 
matrix of the form 

' cn ci2 

AB = C21 C22 

> C31 C32 ) 

Then to obtain, for example, C31 , we multiply corresponding entries in the third row of A times the first column of B and add the results. 
Example 5.3.2. 
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UtA=(J 3).andB=(^ ° ) . Then 

/- 1-3 + 0-2 l-O + O-l \_ /3 \ 
~[ 0-3 + 3-2 0-0 + 3-1 j~ ie 3 j 



/ 3 \ 
Note: B A =1 3 j ^'^^ 

Remarks; 

(1) An n X n matrix is called a square matrix. 

(2) If A is a square matrix, A A is defined and is denoted by A^ , and AAA = A^ , etc. 

(3) The mxn matrices whose entries are all are denoted by 0„xn, or simply 0, when no confusion arises regarding the order. 

EXERCISES FOR SECTIONS 5.1 THROUGH 5.3 
A Exercises 

'■^'^ = (2 3 )'^=(3 -5)'-'^^i3 -2 2 J 

(a) Compute A B and B A. 

(b) Compute A + B and B + A. 

(c) Ifc = 3, show that c(A + B) = cA + cB. 

(d) Show that (AB)C = A(BC). 

(e) Compute A^C. 

(f) Compute B + 

(g) Compute A 02x2 and 02x2 A> where 02x2 is the 2 x 2 zero matrix, 

(h) Compute A, where is the real number (scalar) zero. 

(i) Letc = 2 and (i = 3.Showthat(c + d) A = cA + dA. 



1 2 \ 
2. Let A = 2-15 

.321, 

Compute, if possible; 

(a) A - B 

(b) AB 

(c) AC - BC 

(d) A(BC) 

(e) CA - CB 



,B = 



/' 2 3 \ 
1 1 2 
-1 3 -2, 



,andC = 



2 1 2 3 \ 
4 11 
3-141, 



(f) C 



z 
w 



(2 0\ 

3. Let A = \q 2 I . Find a matrix B such that A B = /andBA = /, where/ 

4 . Find A / and B / where / is as in Exercise 3 , where 



1 
1 



A = (l l).n6B 
What do you notice? 
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5. Find if A: 



I' 1 
2 
,0 3 ; 



. What is A'' equal to? 



B Exercises 



6. (a) Determine and if / = 



fl 0\ 



1 

Vo 1 J 

(b) Wliat is /" equal to for any n > 1? 

(c) Prove your answer to part (b) by induction. 

2 1 



7. (a) If A = ^ 1 )' ^ ~ ( ;<:' )' ^'^'^ ^ ~ ( 1 ) ' = B is a way of expressing the system 2 xi + X2 = 3 using matrices. 



Xl- X2= 1 



(b) Express the following systems of equations using matrices: 

(i) 2xi - X2 = 4 

Xl + X2 = 

(ii) xi + X2+ 2xi= 1 

Xl + 2X2 - X3 = -1 
Xl + 3 X2 + X3 = 5 

(iii) Xl + X2 =3 

X2 =5 
Xl + 3 X3 = 6 
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I 5.4 Special Types of Matrices 



We have aheady investigated one special type of matrix, namely the zero matrix, and found that it behaves in matrix algebra in an analogous 
fashion to the real number 0; that is, as the additive identity. We will now investigate the properties of a few other special matrices. 

Definition: Diagonal Matrix. A square matrix D is called a diagonal matrix ifdij = whenever i ^ j. 
Example 54.1. 



are all diagonal matrices. 



In Example 5.4.1, the 3 x3 diagonal matrix / whose diagonal entries are all I's has the singular property that for any other 3x3 matrix A we 
have A/ = I A = A. For example: 





f 1 





-) 




(-3 





> 




1 1 





\ 


A = 





2 





, B = 











, and/ = 





1 







.0 





5 , 




.0 





-5 , 




.0 





1 . 



Example 54.2. If A = 



AI 



I A = 



r 1 
6 

.3 

/ 1 
6 

,3 



I 1 2 
6 7 

,3 -3 

5 \ 
-2 




then 



and 



2 5 
7 -2 
-3 



In other words, the matrix / behaves in matrix algebra like the real number 1; that is, as a multiplicative identity. In matrix algebra the matrix / 
is called simply the identity matrix. Convince yourself that if A is any nxn matrix A / = I A = A. 

Definition: Identity Matrix. The nXn diagonal matrix whose diagonal components are all I's is called the identity matrix and is 

denoted by lor l„ . 

In the set of real numbers we reaUze that, given a nonzero real number x, there exists a real number y such that xy = yx = 1 . We know that 

real numbers commute under multiplication so that the two equations can be summarized as xy = 1. Further we know that y = x~' = - . Do 

we have an analogous situation in M„x„(IR)? Can we define the multiplicative inverse of an nXn matrix A? It seems natural to imitate the 
definition of multiplicative inverse in the real numbers. 

Definition: Matrix Inverse. Let Abe an nxn matrix. If there exists an nxn matrix B such that AB = B A = I, then B is the multiplica- 
tive inverse of A (called simply the inverse of A) and is denoted by A"' (read "A inverse"). 

When we are doing computations involving matrices, it would be helpful to know that when we find A"\ the answer we obtain is the only 

inverse of the given matrix. 

Remark: Those unfamiliar with the laws of matrices should go over the proof of Theorem 5.4.1 after they have familiarized themselves with the 
Laws of Matrix Algebra in Section 5 .5 . 

Theorem S.4.1. The inverse of an n Xn matrix A, when it exists, is unique. 

Proof: Let A be an n x n matrix. Assume to the contrary, that A has two (different) inverses, say B and C. Then 



5 = 5/ 
= 5(AC) 

= (BA)C 
= IC 

= c 

Example 5.43. Let A = 



Identity property of / 
Assumption that C is an inverse of A 
Associativity of matrix multiplication 
Assumption that B is an inverse of A 
Identity property of / ■ 



2 
3 



What is A ' ? Without too much difficulty, by trial and error, we determine that A 



^ 
1^ 

3 / 



This 



1 2 
-3 5 



, the "reciprocal rule" would tell us that the inverse of A is 5 = 



. Try computing A 5 and you will see that you don't get 



might lead us to guess that the inverse is found by taking the reciprocal of all nonzero entries of a matrix. Alas, it isn't that easy! If 

MM 

2 

3 5 

the identity matrix. So, what is A~'? In order to understand more completely the notion of the inverse of a matrix, it would be beneficial to 
have a formula that would enable us to compute the inverse of at least a 2x2 matrix. To do this, we need to recall the definition of the determi- 
nant of a 2 X 2 matrix. Appendix A gives a more complete description of the determinant of a 2 x 2 and higher-order matrices. 

( a b \ 

Definition: Determinant of a 2 x2 Matrix. Let ^ = I ^ ^1- '^^^ determinant of A is the number detA = ad - be. 
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In addition to det A, common notation for the determinant of matrix A is | A | . This is particularly common when writing out the whole matrix, 
' a b \ 

for the determinant of the general 2x2 matrix. 



which case we would write 
Example 5AA. 



c d 



If A = 



1 2 
-3 5 



thendetA = 1-5 -2-(-3) = ll. 
thendetB = 1-4 -2-2 = 



I a b\ , \ I d -b\ 

Theorem 542. LetA = \ , . If det A ^0, then A'' =- — 

\C d ) det A \-c a 



' a b 

Proof: See Exercise 4 at the end of this section. 



Example 5 AS. Can we find the inverses of the matrices in Example 5.4.4? 



< 5 


2 ^ 


11 


11 


3 


1 


I. 11 


11 - 



The reader should verify that A A ^ = A ^ A = /. 

The second matrix, B has a determinant equal to zero. We we tried to apply the formula in Theorem 5.4.2, we would be dividing by zero. 
For this reason, the formula can't be applied and in fact B"' does not exist. 

Remarks: 

(1) In general, if A is a 2 x 2 matrix and if det A =0, then A"^ does not exist. 

(2) A formula for the inverse of nXn matrices n > 3 can be derived that also involves det A, Hence, in general, if the determinant of a matrix is 
zero, the matrix does not have an inverse. However the formula for even a 3 x 3 matrix is very long and is not the most efficient way to 
compute the inverse of a matrix. 

(3) In Chapter 12 we wiU develop a technique to compute the inverse of a higher-order matrix, if it exists. 

(4) Matrix inversion comes first in the hierarchy of matrix operations; therefore, A B"' is A(B~') . 

EXERCISES FOR SECTION 5.4 
A Exercises 

1 . For the given matrices A find A~ ' if it exists and verify that AA~^ =A~^ A = I IfA"' does not exist explain why . 

/ 6 -3 \ 

'"""(s -4) 

(d) A 



' 1 
,0 1 



(e) Use the definition of the inverse of a matrix to find A ' : 

/'3 \ 

A= - 

2 

,0 -5 , 

2. For the given matrices A find A~' if it exists and verify that AA'^ =A'^ A = I If A~' does not exist explain why. 

2 -1 



-1 2 



(a) A = 
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■ 1 c 
,0 1 



(c) A = 

I a b \ 

(d) A = I ^ ^1, were a > b>0. 



3. (a) Let A = ^ ^ ^ j and 5 = ^ ^ i ]■ "Verify that (AB)-' = B"' A"' . 

(b) Let A and B be nxn invertible matrices. Prove that (AB)"' = B'^ A~^. Why is the right side of the above statement written "backwards"? 
Is this necessary? Hint: Use Theorem 5.4. L 

B Exercises 

(a b \ _i 
^ ^1. Derive the formula for A \ 

5. (a) Let A and B be as in problem 3 above. Show that det(AB) = (detA) (detB). 

(b) It can be shown that the statement in part (a) is true for all nxn matrices. Let A be any invertible nXn matrix. Prove that 
det(A"') = (detA)"' . Note: The determinant of the identity matrix /„ is 1 for all n, see Appendix A for details, 
(c) Verify that the equation in part (b) is true for the matrix in exercise 1(a) of this section. 



6. Prove by induction that for n > 1, 



a 0\"_la" 
,0 bj ~ [ b" 

7. Use the assumptions in exercise 5 to prove by induction that if n > 1, det(A") = (detA)". 

8. Prove: If the determinant of a matrix A is zero, then A does not have an inverse. Hint: Use the indirect method of proof and exercise 5. 

C Exercise 

9. (a) Let A, B, andD be nxn matrices. Assume that B is invertible. If A = BDB~^ , prove by induction that A™ = Biy" B'^ is true for 
m > 1. 



(b) Given that A = ^^j=B^^ ^ )^"' where B = ^ ^ ^jwhatisA")? 
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I 5.5 Laws of Matrix Algebra 



The following is a summary of the basic laws of matrix operations. Assume that the indicated operations are defined; that is, that the orders of 
the matrices A, B, and C are such that the operations make sense. 

(1) A + B = B + A 

(2) A + (B + C) = (A + B) + C 



(3) c(A + B) = cA + cB, where c e R. 

(4) (ci + C2) A = C[ A +C2A,whereci, C2 e R. 



(5) ci(c2A) = (c] •C2 ) A, where ci , C2 



(6) OA = 0, where is the zero matrix. 



(7) OA = 0, where on the left is the number 0. 

(8) A + = A. 



(9) A + (-l)A = 0. 



(10) A(B + C) = AB + AC. 



(11) (B + C)A = BA + CA. 

(12) A(BC) = {AB)C. 



(13) I A = A and A/ = A. 



(14)IfA-' exists, (A-' )"' = A. 



(15)If A-' andfi-' exist, (AB)"' = B-'A"' 



Example 5.5.1. If we wished to write out each of the above laws more completely, we would specify the orders of the matrices. For 
example. Law 10 should read: 

(10) Let A, B, and C be mxn, nxp, and «x/7 matrices, respectively, then A (S + C) = AB + AC 
Remarks: 

(1) Notice the absence of the "law" AB = BA. Why? 

(2) Is it really necessary to have both a right (No. 1 1) and a left (No. 10) distributive law? Why? 

(3) What does Law 8 define? What does Law 9 define? 

EXERCISES FOR SECTION 5.5 
A Exercises 

1. Rewrite the above laws specifying as in Example 5.5.1 the orders of the matrices. 

2. Verify each of the Laws of Matrix Algebra using examples. 

( I 2 \ (3 1 6\ (0 -2 A\ 

3. Let A = „ i T 5 Land C=\ . Compute the following as efficiently as possible by using any of the Laws of 



.0 -1 j U -1 5 j V7 1 1 

Matrix Algebra: 

(a) AB + AC 

(b) A-' 

(c) A(B + C) 
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(d) (A2)-i 

(e) (C + B)-i A-i 

4. Let = ^ 2 1 ) ^"'^ ^ ~ ( 2 4 ) ■ '-•°™P'^'^® following as efficiently as possible by using any of the Laws of Matrix Algebra: 

(a) AB 

(b) A + B 

(c) + AB + BA + B^ 

(d) B-^A-i 

(e) A^ + AB 

5 . Let A and B be n x n matrices of real numbers .Is A^ - B^ = (A- B)(A + B)l Explain 
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I 5.6 Matrix Oddities 

We have seen that matrix algebra is similar in many ways to elementary algebra. Indeed, if we want to solve the matrix equation AX = B for 
the unknown X, we imitate the procedure used in elementary algebra for solving the equation ax = b. Notice how exactly the same properties 
are used in the following detailed solutions of both equations. 



Solution of ax = 


b 


Solution of A = B 




ax = b 




AX = B 




a~^{a x) = a'^ b 


if a # 


A-\AX) = A-^B 


if A~^ exists 


(a~' a)x = b 


associative law 


(A-' A)Z = A-'B 


associative law 


Ix = a'^ b 


definition of inverse 


IX = A-ifi 


definition of inverse 


X = rr' b 


itlentit\' propert\' (if 1 


X = /\-' B 


itlentit\' prciperty (if / 



Certainly the solution process for AX = B is the same as that of ax = b. 



The solution of xa = bis x = ba ^ = a ^ b.Jn fact, we usually write the solution of both equations as x = - . In matrix algebra, the solution of 

a 

XA = B isX = B A ' , which is not necessarily equal to A ' B. So in matrix algebra, since the commutative law (under multiplication) is not 
true, we have to be more careful in the methods we use to solve equations. 

It is clear from the above that if we wrote the solution of AX = B as X = ^, we would not know how to interpret the answer ^. Does it mean 
A~' B or B A"' ? Because of this, A'^ is never written as - . 

A 

Some of the main dissimilarities between matrix algebra and elementary algebra are that in matrix algebra: 

(1) A B may be different from B A. 

(2) There exist matrices A and B such that A B = , and yet A + and B + . 

(3) There exist matrices A where A i= 0, and yet A^ = 0. 

(4) There exist matrices A where A^ - A with Ai=I and Ai=0 

(5) There exist matrices A where A^ =1 , where A # / and A + -I 

EXERCISES FOR SECTION 5.6 
A Exercises 

1. Discuss each of the above "oddities" with respect to elementary algebra. 

2. Determine 2x2 matrices which show each of the above "oddities" are true. 

B Exercises 

3 . Prove the following implications , if possible: 

(a) A^ = A and det A ^ A = / 

(b) A2 = /anddetA * => A = lor A = -I. 

4. Let M„xn(IR) be the set of real nxn matrices. Let P c M„xn(IR) be the subset of matrices defined by A e Pif and only if A^ = A. Let 
Q c P be defined by A e 2 if and only if det A # 0. 

(a) Determine the cardinality of Q. 

(b) Consider the special case n = 2 and prove that a sufficient condition for A e P c M2x2(IR) is that A has a zero determinant (i.e., A is 
singular) and tr (A) = 1 where tr (A) = an + a 22 is the sum of the main diagonal elements of A. 

(c) Is the condition of part b a necessary condition? 
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C Exercises 

5. Write each of the following systems in the form AX = B, and then solve the systems using matrices. 

(a) 2 xi + X2 = 3 

Xl-X2= 1 

(b) 2 jTi - X2 = 4 

Xi - X2 = 

(C) 2 Xi + X2 = 1 
Xi - X2 = 1 

(d) 2 xi + X2 = 1 

Xi ~ X2 = ~ 1 

(e) 3 xi + 2 X2 = 1 
6xi + 4x2 = -1 

6. Recall that p (x) = x^ - 5 x + 6 is called a polynomial, or more specifically, a polynomial over R, where the coefficients are elements of R 
and X e R. Also, think of the method of solving, and solutions of, x^ - 5 x + 6 = 0. We would like to define the analogous situation for 2x2 
matrices. First define where A is a 2x2 matrix p(A) =A^-5A + 6I. Discuss the method of solving and the solutions of 
A^ - 5A + 67 = 0. 

7. (For those who know calculus) 

(a) Write the series expansion for e" centered around a = 0. 

(b) Use the idea of exercise 6 to write what would be a plausible definion of c'* where A is an n x n matrix. 

MM -M A ( e e-l \ B / 1 -M 

(c) If A = I p pi and B = I q I > use the series m part (b) to show that * = I q ^ I * = I q j I- 

(d) Show that + 

(e) Show that c^+* = ( o 1 ) 

(f) Is e^e^ = e^*^l 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 5 



Sections 5.1 through 5.3 

1 . Detemiine x and y in the following: 





' 1 





-1 ^ 




fO 





2 - 






2. LetA = 


2 


1 


5 


, B = 


3 


-1 


1 


,c = 


4 




.3 


-4 


1 . 




.1 


2 


-1 . 




,5 , 



. Compute: 



(a) 2A-3S 

(b) 2A-5A 

(c) AC + BC 

3 . Let A and B be two m X m matrices with AB = BA. Prove by induction on n that AB" =B" A for n greater than or equal to 1 . 

4. Prove by induction that if n is a positive integer, and 





( 1 


1 


> 




■ 1 


n n(n - l)/2 ^ 


A = 





1 


1 


, then A" = 





1 n 




.0 





1 , 




.0 


1 



Section 5.4 

5. Determine A"' A' if A : 



2 3 
1 4 



I 4 -2\ 12 0\ 

6.UtA = (_2 5 )andB = (j J 



Compute A + B, A^ + AB +BA + B^, and B ' A ^ . You may save some time by thinking before plunging into the computations. 
Dr wl 



7. For what real number c wiU the matrix D have no inverse? Explain your answer. 

'3 15 
.4 c 



8. LetP: 



;M2x2(R) |arf*fc}. 



Fact: The inverse of a diagonal matrix belonging to P can be found simply by reciprocating the diagonal elements of the matrix 

3 

.0 6 

' a b 



(a) Determine j 

I a b \ I a b I l/a b \ I a b \ 

(b) Suppose ^ j e P and ^ j =y ^ l/d }' ^ general, ^^y^ d j ^ diagonal matrix? If yes, explain why; if no, give the most 

general form of such a matrix ^ 

Section 5.5 

9. (a) Let A and Bb&nxn matrices. Expand (A + B)^. 
(b) Is (A + B)2 ever equal to A^ + 2 AB + B^? Explain. 

10. Solve the following matrix equation for X. Be careful to explain under which conditions each step is possible. 

AX + C = BX 

Section 5.6 

1 1 . Prove or disprove: A"' = A and B ' = B => (AB)"' = AB. 

12. The following is true for all real numbers a and b: a-b = if and only if a = or = 0. Is any part of this statement true for n x n matrices A 
and B? Explain. Give an example and proof. 



13. Let A : 



where a, b, c, d eM.. Show that the matrices of the form A : 



1 

c -1 



and A : 



1 b 
-1 



are also solutions to the 



equation A^ = /, confirming that a quadratic matrix equation can have an infinite number of solutions. Are there any others? 
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chapter 6 

RELATIONS AND GRAPHS 




GOALS 

One understands a set of objects completely only if the structure of that set is made clear by the interrelationships between its elements. For 
example, the individuals in a crowd can be compared by height, by age, or through any number of other criteria. In mathematics, such compar- 
isons are called relations. The goal of this chapter is to develop the language, tools, and concepts of relations. 

I 6.1 Basic Definitions 

In Chapter 1 we introduced the concept of the Cartesian product of sets. Let's assume that a person owns three shirts and two pairs of slacks. 
More precisely, let A = {blue shirt, tan shirt, mint green shirt) and B = {grey slacks, tan slacks). Then certainly A x B is the set of all possible 
combinations (six) of shirts and slacks that the individual can wear. However, the individual may wish to restrict himself or herself to combina- 
tions which are color coordinated, or "related." This may not be all possible pairs in AxB but will certainly be a subset of AxB. For example, 
one such subset may be I (blue shirt, grey slacks), (blue shirt, tan slacks), (mint green shirt, tan slacks)). 

Definition: Relation. Let A and B be sets. A relation from A into B is any subset of AxB. 

Example 6.1.1. Let A = (1, 2, 3) and 5 = |4, 5).Then|(l, 4), (2, 4), (3, 5)) is a relation from A into S. Of course, there are many 
others we could describe; 64, to be exact. 

Example 6.1.2. Let A = |2, 3, 5, 6) and define a relation r from A into A by (a, b) e r if and only if a divides evenly into b. The set of 
pairs that qualify for membership is r = |(2, 2), (3, 3), (5, 5), (6, 6), (2, 6), (3, 6)). 

Definition: Relation on a Set. A relation from a set A into itself is called a relation on A. 
The relation "divides" in Example 6.1 .2 is will appear throughout the book. Here is a general definition on the whole set of integers. 

Definition: Divides. Let a, b e Z. 

a \ b if and only if there exists an integer k such that a k = b. 

Be very careful in writing about "divides." The vertical line symbol use for this relation, if written carelessly, can look like division. While a | b is either true or false, 
a/ b \s a number. 

Based on the equation ak = fo, we can say that a\bis equivalent to k = -, or a divides evenly into b. In fact the "divides" is short for "divides 
evenly into." You might find the equation k = - initially easier to understand, but in the long run we will find the equation ak = b more 
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convenient. 

Sometimes it is helpful to illustrate a relation with a graph. Consider Example 6.1.1. A graph of r can be drawn as in Figure 6.1.1. The arrows 
indicate that 1 is related to 4 under ;-. Also, 2 is related to 4 under r, and 3 is related to 5, while the upper arrow denotes that r is a relation from 
the whole set A into the set B. 




FIGURE 6.1 .1 A graph of a relation 



A typical element in a relation r is an ordered pair (x, y). In some cases, r can be described by actually listing the pairs which are in r, as in the 
previous examples. This may not be convenient if r is relatively large. Other notations are used with certain well-known relations. Consider the 
"less than on equal" relation on the real numbers. We could define it as a set of ordered pairs this way: 

s = {{x, y) \ X < y]. 

The notation x < yis clear and self-explanatory; it is a more natural, and hence preferred, notation to use than (x, y) e s. 

Many of the relations we will work with "resemble" the relation <, so x ^ j is a common way to express the fact that x is related to y through the 
relation s. 

Relation Notion. Let s be a relation from a set A into a set B. Then the fact that (x, y) £ s is frequently written xsy. 

Let A = |2, 3, 5, 81, B = (4, 6, 16), and C = |1, 4, 5, 71; let r be the relation "divides," denoted by |, from A into B; and let s be the 
relation < from B into C. So r = ((2, 4), (2, 6), (2, 16), (3, 6), (8, 16)| and i = ((4, 4), (4, 5), (4, 7), (6, 7)|. 

Notice from Figure 6.1 .2 that we can, for certain elements of A, go through elements in B to results in C. That is: 

2 I 4and4 < 4 
2 I 4and4 < 5 
2 I 4and4 < 7 

2 I 6and6 <7 

3 I 6and6 < 7 

Based on this observation, we can define a new relation, call it rs, from A into C. In order for (a, c) to be in rs, it must be possible to travel 
along a path in Figure 6.1 .2 from a to c. In other words, (a, c) e rs if and only if (3 b)^ (arb and b s c). The name rs was chosen because it 
reminds us that this new relation was formed by the two previous relations r and s. The complete listing of all elements in rs is 
1(2, 4), (2, 5), (2, 7), (3, 7)). We summarize in a definition. 
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A ► B ► C 





r 




s 




2 




1 

Q 


\ 

3 




4 




4 






o^- 




»-0 


5 

o 




6 




■^^^ 5 




8 




\ 16 




7 


e 




»-e 




o 



FIGURE 6.1.2 Graphical representation of composition of relations 



Definition: Composition of Relations. Let r be a relation from a set A into a set B, and let s be a relation from B into a set C. The 
composition of r with s, written rs, is the set of pairs of the form (a, c) e AxC, where {a, c) Ers if and only if there exists b eB such that 
(a, b) Erand(b, c) E s. 

Remark: A word of warning to those readers familiar with composition of functions. (For those who are not, disregard this remark. It will be 
repeated at an appropriate place in Chapter 7.) As indicated above, the traditional way of describing a composition of two relations is rs where r 
is the first relation and i the second. However, function composition is traditionally expressed "backwards"; that is, s°r, where r is the first 
function and s is the second. 

EXERCISES FOR SECTION 6.1 
A Exercises 

1 . For each of the following relations r defined on P, determine which of the given ordered pairs belong to r. 

(a) xrymx\y- (2, 3), (2, 4), (2, 8), (2, 17) 

(b) xryiiix < y; (2, 3), (3, 2), (2, 4), (5, 8) 

(c) xryiiiy = x^ ; (1,1), (2, 3), (2, 4), (2, 6) 

2 . The following relations are on { 1 , 3 , 5 } . Let r be the relation xryiiiy = x + 2 and s the relation xsy iiix < y. 

(a) Findri. 

(b) Find sr. 

(c) Illustrate rs and sr via a diagram. 

(d) Is the relation (set) rs equal to the relation srl Why? 

3. Let A = |1, 2, 3, 4 , 5| and define r on Aby xry \ii x + 1 = y. We 

define r^ = rr and r^ = r^ r. Find: 

(a) r 

(b) r^ 

(c) r^ 

4. Given s and f, relations on Z, i = {(1, n) : « e Z) and / = ((«, 1) : « e Z), what are st and /^? Hint: Even when a relation involves 
infinite sets, you can often get insights into them by drawing partial graphs. 

B Exercises 

5 . Let p be the relation on the power set , ?'(5 ) , of a finite set S of cardinality n . Define p by (A , B) e p iff A f) B = . 
(a) Consider the specific case n = 3, and determine the cardinality of the set p. 
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(b) What is the cardinality of p for an arbitrary n? Express your answer in terms of n. (Hint: There are three places that each element of S can 
go in building an element of p.) 

6. Let ri,r2, and rs be relations on any set A. Prove that if ri c rithen ri c^r^r^. 
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I 6.2 Graphs of Relations on a Set 

In this section we introduce directed graphs as a way to visualize relations on a set. 

Example 6.2.1, Let A = {0, 1, 2, 3}, and let 

r = {(0, 0), (0, 3), (1, 2), (2, 1), (3, 2), (2, 0)}. 

The elements of A are called the vertices of the graph. They are represented by labeled points or occasionally by small circles. We connect 
vertex a to vertex b with an arrow, called an edge, going from vertex a to vertex b if and only i( arb. This type of graph of a relation r is 
called a directed graph or digraph. Figure 6.2.1 is a digraph for r. Notice that since 1 r 2 and 2 r 1, we draw a single edge between 1 and 2 
with arrows in both directions. Since is related to itself, we draw a "self-loop" at 0. 



FIGURE 6.2.1 

The actual location of the vertices is immaterial. The main idea is to place the vertices in such a way that the graph is easy to read. Obviously, 
after a rough-draft graph of a relation, we may decide to relocate the vertices so that the final result wiU be neater. Figure 6.2.1 could also be 
presented as in Figure 6.2.2. 




FIGURE 6.2.2 



A vertex of a graph is also called a node, point, or a junction. An edge of a graph is also referred to as an arc, a line, or a branch. Do not be 
concerned if two graphs of a given relation look different. 

Example 6.2.2. Consider the relation s whose digraph is Figure 6.2.3. What information does this give us? The graph tells us that j is a 
relation on A = {1, 2, 3} and that 

s = {(1, 2), (2, 1), (1, 3), (3, 1), (2, 3), (3, 3)), 
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FIGURE 6.2.3 

Example 6^3. Let B = {1, 2),andlet A = P(B) = {0, (1), (2), {1, 2)). Then c is a relation on A whose digraph is Figure 6.2.4. 




FIGURE 6.2.4 

We will see in the next section that since c has certain structural properties that describe "partial orderings." We will be able to draw a much 
simpler type graph than this one, but for now the graph above serves our purposes. 

EXERCISES FOR SECTION 6.2 

A Exercises 

1. Let A = |1, 2, 3, 4), and let r be the relation < on A. Draw a digraph forr. 

2. Letfi = (1,2, 3, 4, 6, 8, 12, 24), and let .s be the relation "divides," on B. Draw a digraph fori. 

3. Let A = {1 , 2, 3, 4, 5). Define t on Aby at b if and only if b - a is even. Draw a digraph for t. 

4. (a) Let A be the set of strings of O's and I's of length 3 or less. Define the relation of on A by xdy if x is contained within y. For example, 
01 101 . Draw a digraph for this relation. 

(b) Do the same for the relation defined by x p y if x is a prefix of y. For example, 10 p 101 , but 01 p 101 is false. 

B Exercises 

5. Recall the relation in Exercise 5 of Section 6.1 , p defined on the power set, P(S), of a set S. The definition was (A, B) e p iff A p] S = 0. 
Draw the digraph for p where 5 = {a, b). 

6. LetC= {1,2, 3, 4, 6, 8, 12, 24) and define f on C by 

atb if and only if a and b share a common divisor greater than 1. 
Draw a digraph for t. 
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I 6.3 Properties of Relations 

Consider the set B = {1, 2, 3, 4, 6, 12, 36, 48} and the relations "divides" and < on B. We notice that these two relations on B have three 
properties in common: 

(1) Every element in B divides itself and is less than or equal to itself. This is called the reflexive property. 

(2) If we search for two elements from B where the first divides the second and the second divides the first, then we are forced to choose the 
the two numbers to be the same. In other words, no two different numbers are related in both directions. The reader can verify that a similar 
fact is true for the relation < on B. This is called the antisymmetric property, 

(3) Next if we choose three numbers from B such that the first divides the second and the second divides the third, then we always find that the 
first number to divides the third. Again, the same is true if we replace "divides" with "is less than or equal to." This is called the transitive 
property. 

Relations that satisfy these properties are of special interest to us. Formal definitions of the properties follow. 

Definition: Refiexive Relation. Let Abe a set and let rbea relation on A. 
r is reflexive if and only ifar a for all a e A. 

Definition: Antisymmetric Relation. Let Abe a set and let rbea relation on A. 

r is antisymmetric if and only if whenever arb and a ^ b then bra is false. 

An equivalent condition for antisymmetry is that whenever arb and bra then a = b. You are encouraged to convince yourself that this is 
the case. 

A word of warning about antisymmetry: Students frequently find it difficult to understand this definition. Keep in mind that this term is defined 
through an "If . .. then . . ." statement. The question that you must ask is: Is it true that whenever there are elements a and b from A where arb 
and a + b,it follows that b is not related to a? If so, then the relation r is antisymmetric. 

Another way to determine whether a relation is antisymmetric is to examine its digraph. The relation is not antisymmetric if there exists a pair 
of vertices that are connected by edges in both directions. 

Definition: Transitive Relation. Let A be a set and let rbea relation on A. 

r is transitive if and only if whenever arb and brc then arc. 

Partial Orderings 

Not all relations have all three of the properties discussed above. Those that do, are a special type of relation. 

Definition: Partial Ordering, Poset. A relation on a set A that is reflexive, antisymmetric, and transitive is called a partial ordering on 
A. A set on which there is a partial ordering relation defined is called a partially ordered set or poset. 

Example 63.1. Let A be a set. Then f (A) together with the relation c (set containment) is a poset. To prove this we observe that the 

three properties hold, as discussed in Chapter 4. 

(1) Let B e f(A). The fact that B c B follows from the definition of subset. Hence, set containment is reflexive. 

(2) Let B] , B2 e y (A) and assume that Bi c B2 and B] t B2 . Could it be that B2 c Bj? No. There must be some element a e A such that 
a ^ B], but a e B2. This is exactly what we need to conclude that B2 is not contained in Bi . Hence, set containment is antisymmetric 

(3) Let Bi, B2, B3 e f (A) and assume that Bi c B2 and B2 c B3 . Does it follow that Bi c B3 ? Yes, if a e Bi, then a e B2 because 
Bi c B2. Now that we have a e B2 and we have assumed B2 c B3, we conclude that a e B3. Therefore, Bi c B3 and so set containment is 
transitive. 

Figure 6.3.1 is the graph for the "set containment" relation on {1 , 2). 
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FIGURE 6.3.1 



This graph is helpful insofar as it reminds us that each set is a subset of itself (How?) and shows us at a glance the relationship between the 
various subsets in P ((1, 2)). However, when a relation is a partial ordering, we can streamline a graph like this one. The streamlined form of 
a graph is called a Hasse diagram or ordering diagram. A Hasse diagram takes into account the following facts. 

(1) By the reflexive property, each vertex must be related to itself, so the arrows from a vertex to itself (called "self-loops") are not drawn in a 
Hasse diagram. They are simply assumed. 

(2) By the antisymmetry property, connections between two distinct elements in a directed graph can only go one way, if at all. When there is 
a connection, we agree to always place the second element above the first (as we do above with the connection from |1) to |1, 2)). For this 
reason, we can just draw a connection without an arrow, just a line. 

(3) By the transitive property, if there are edges connecting one element up to a second element and the second element up to a third element, 
then there will be a direct connection from the first to the third. We see this in Figure 6.3.1 with connected to |1) and then |1) connected to 
|1, 2). Notice the edge connecting to (1, 2). Whenever we identify this situation, remove the connection from the first to the third in a Hasse 
diagram and simply observe that an upward path of any length implies that the lower element is related to the upper one. 

Using these observations as a guide, we can draw a Hasse diagram for c on { 1 , 2) as in Figure 6.3.2. 




FIGURE 6.3.2 
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Example 63.2. Consider the partial ordering relation s whose Hasse diagram is Figure 6.3.3. 




FIGURE 6.3.3 

How do we read this diagram? What is A? What is si What does the digraph of s look like? Certainly A = {1, 2, 3, 4, 5) and 1 s2, 3s4, 
1 .s 4, 1 .s 5, etc., Notice that 1 ^ 5 is implied by the fact that there is a path of length three upward from 1 to 5. This follows from the edges 
that are shown and the transitive property that is presumed in a poset. Since 1^3 and 3 .v 4, we know that 1 j 4. We then combine 1 j 4 with 
4 i 5 to infer I s5. Without going into details why, here is a complete list of pairs defined by s. 

s = ((1, 1), (2, 2), (3, 3), (4, 4), (5, 5), (1, 3), (1,4), (1, 5), (1, 2), (3, 4), (3, 5), (4, 5), (2, 5)) 
A digraph for s is Figure 6.3.4. It is certainly more complicated to read and difficult to draw than the Hasse diagram. 




FIGURE 6.3.4 Digraph of Example 6,3.2 

A classic example of a partial ordering relation is < on the real numbers, R. Indeed, when graphing partial ordering relations, it is natural to 
"plot" the elements from the given poset starting with the "least" element to the "greatest" and to use terms like "least," "greatest," etc. Because 
of this the reader should be forewarned that some texts use the symbol < for arbitrary partial orderings. This can be quite confusing for the 

novice, so we continue to use generic letters r, s, etc. 

Equivalence Relations 

Another common property of relations is symmetry. 

Definition: Symmetry. Let r he a relation on a set A.r is symmetric if and only if whenever arb, it follows that bra. 

Consider the relation of equality (=) defined on any set A. Certainly a = b impUes that 6 = a so equality is a symmetric relation on A. 

Surprisingly, equaUty is also an antisymmetric relation on A. This is due to the fact that the condition that defines the antisymmetry property, 
a = b and a t b,is a contradiction. Remember, a conditional proposition is always true when the condition is false. So a relation can be both 
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symmetric and antisymmetric on a set! Again recall that these terms are not negatives of one other. That said, there are very few relations that 
are both symmetric and antisymmetric. 

Definition: Equivalence Relation. A relation r on a set A is called an equivalence relation if and only if it is reflexive, symmetric, and 
transitive. 

The classic example of an equivalence relation is equality on a set A. In fact, the term equivalence relation is used because those relations which 
satisfy the definition behave quite like the equality relation. Here is another important equivalence relation. 

Example 6.3.3. Let Z* be the set of nonzero integers. One of the most basic equivalence relations in mathematics is the relation q on 
Z xZ* defined by (a, b) q (c, d) if and only Had = be. We will leave it to the reader to, verify that q is indeed an equivalence relation. Be 
aware that since the elements of ZxZ*are ordered pairs, proving symmetry involves four numbers and transitivity involves six numbers. 
Two ordered pairs, (a, b) and (c, d), are related if the fractions ^ and ^ are numerically equal. 

FIGURE 6.3.5. 

Example 634. Let m be a positive integer, m > 2. We define congruence modulo m to be the relation =„ defined on the integers by 
a =mb a and only if m \ (a- b) 

(1) This relation is reflexive, for if a e Z, m | (a - a) => a =^ a. 

(2) This relation is symmetric. We can prove this through the following chain of implications. 

a =mb => m\ {a-b) 

=>a - b = mk for some A: e Z 
=^ b - a = m{-k) 

m\{b - a) 
^ b =„a 

(3) Finally, this relation is transitive. We leave it to the reader to prove that if a =„ 6 and b =„ c, then a =„ c. 

On occasion, you will see the equivalent notation asb (mod m) for this relation. 

Example 6.3.5. Consider the relation s described by the digraph in Figure 6.3.5. 
This relation is reflexive (Why?) 
It is not symmetric (Why?) 
It is not transitive (Why?) 

Is J an equivalence relation or a partial ordering? It is neither, and among the vaUd reasons why is that s is not transitive. 




EXERCISES FOR SECTION 6.3 
A Exercises 

1. (a) Let B = {a, b) and U = P(B). Draw a Hasse diagram for c on U. 

(b) Let A = {1,2, 3, 6}. Show that divides, I, is a partial ordering on A. 

(c) Draw a Hasse diagram for divides on A. 

(d) Compare the graphs of parts a and c. 

2. Repeat Exercise 1 with B = {a, b, c] md A= {1,2, 3, 5, 6, 10, 15, 30} . 

3. (a) Consider the relations defined by the digraphs in Figure 6.3.6. Determine whether the given relations are reflexive, symmetric, antisym- 
metric, or transitive. Try to develop procedures for determining the validity of these properties from the graphs, 

(b) Which of the graphs in Figure 6.3.6 are of equivalence relations or of partial orderings? 



Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States 



116 



Chapter 6 - Relations 



(i) (ii) 




Figure 6.3.6 

4. Determine which of the following are equivalence relations and/or partial ordering relations for the given sets: 

(a) A = {lines in the plane}; xryii and only if x is parallel to y. 

(b) A = R; X r J if and only if | x — j | < 7. 

5. Consider the following relation on 1 1 , 2, 3, 4, 5, 6). r = \i - j\ = 2). 

(a) Is r reflexive? 

(b) Is r symmetric? 

(c) Is r transitive? 
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(d) Draw a graph of r. 

6. For the set of cities on a map, consider the relation xry ii and only if city x is connected by a road to city y. A city is considered to be 
connected to itself, and two cities are connected even though there are cities on the road between them. Is this an equivalence relation or a 

partial ordering? Explain. 

7. Let A = |0, 1, 2, 3) and let 

r = {(0, 0), (1, 1), (2, 2), (3, 3), (1, 2), (2, 1), (3, 2), (2, 3), (3, 1), (1, 3)}. 

(a) Show that r is an equivalence relation on A. 

(b) Let a e A and define c (a) = [b e A | arh}. c(a) is called the equivalence class of a under r. Find c (a) for each element a e A. 

(c) Show that [c (a) | a e A) forms a partition of A for this set A. 

(d) Let r be an equivalence relation on an arbitrary set A. Prove that the set of all equivalence classes under r constitutes a partition of 
A. 

8. Define r on the power set of {1, 2, 3} by ArB <=> | A | = \ B \ . Prove that r is an equivalence relation. What are the equivalence 
classes under r? 

9. Consider the following relations on Zg = {0, 1, . . . , 7}. Which are equivalence relations? For the equivalence relations, Ust the equiva- 
lence classes. 

(a) arb iff the English spellings of a and b begin with the same letter. 

(b) ash iff a - b is, sl positive integer. 

(c) atb iff fl - is an even integer. 

10. (a) Prove that conguence modulo m, introduced in Example 6.3.4, is a transitive. 

(b) What are the equivalence classes under conguence modulo 2? 

(c) What are the equivalence classes under conguence modulo 10? 

B Exercises 

11. In this exercise, we prove that impUcation is a partial ordering. Let A be any set of propositions. 

(a) Verify that q ^ ^ is a tautology, thereby showing that is a reflexive relation on A. 

(b) Prove that => is antisymmetric on A. Note: we do not use = when speaking of propositions, but rather equivalence, 

(c) Prove that ^ is transitive on A. 

(d) Given that is the proposition n < / on I^, draw the Hasse diagram for the relation on {^i, q^, qs, ...}. 

C Exercise 

12. Let S = {1, 2, 3, 4, 5, 6, 7) be a poset (5, <) with the Hasse diagram shown in Figure 6.3.7. Another relation r c 5x5 is defined as 
follows: (x, j) e r if and only if there exists z e 5 such that z < x and z < y in the poset (5, <). 

(a) Prove that r is reflexive. 

(b) Prove that r is symmetric. 

(c) A compatible with respect to relation r is any subset Q of set 5 such that x & Q andy e g (jr, y) e r. A compatible g is a 
maximal compatible if Q is not a proper subset of another compatible. Give all maximal compatibles with respect to relation r defined 
above. 
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(d) Discuss a characterization of the set of maximal compatibles for relation r when (S, <) is a general finite poset. What conditions, 
if any, on a general finite poset (5, <) will make r an equivalence relation? 




FIGURE 6.3.7 
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I 6.4 Matrices of Relations 



We have discussed two of the many possible ways of representing a relation, namely as a digraph or as a set of ordered pairs. In this section we 
win discuss the representation of relations by matrices and some of its applications. 

Definition: Adjacency Matrix. Let A = {oj, 02, ■ ■ ., a^] and B = {bj, b2, ■ ■ ., b„] be finite sets of cardinality m and n, respectively. Let r 
be a relation from A into B. Then r can be represented by themxn matrix R defined by 

^ ^fl if airbj 
'■' otherwise 

R is called the adjacency matrix (or the Boolean matrix, or the relation matrix) ofr. 

Example 6.4.1. Let A = (2, 5, 6) and let r be the relation |(2, 2), (2, 5), (5, 6), (6, 6)) on A. Since r is a relation from A into the 
same set A (the B of the definition), we have ai = 2, 02 = 5, and 03 = 6, and bi = 2,b2 = 5, and b^ = 6. Next, since 

2r2,wehavei?ii = 1; 



2r5,wehavei?i2 = 1 

5 r 6, we have R23 = 1 

6 r 6, we have R33 = 1 
All other entries of R are zero, so 



and 



n 1 ^ 



R 



1 
1.0 U 

From the definition of r and of composition, we note that 

r^ = {(2, 2) (2, 5) (2, 6) (5, 6) (6, 6)}, 
The adjacency matrix of r^ is 
1 1 1 



R^ 



1 

lO 1 J 



We do not write R^ only for notational purposes. In fact, R^ can be obtained from the matrix product RR; however, we must use a slightly 
different form of arithmetic. 

Definition: Boolean Arithmetic. Boolean arithmetic is the arithmetic defined on {0, 1} using Boolean addition and Boolean multiplica- 
tion, defined as: 

0+0=0 0+1=1+0=1 1+1=1 
0-0=0 • 1 = 1 = 11 = 1. 

Notice that from Chapter 3, this is the "arithmetic of logic," where + replaces "or" and ■ replaces "and." 
Example 6A2. 



If R - 



10 
10 10 

10 1 
10 1 J 



and S 



Then using Boolean arithmetic, R S = 



ro 1 1 1 
11 

1 
10 J 

(0 1 1 ^ 

111 

11 
1,0 I ) 



and SR -■ 



/- 1 1 1 M 
111 
10 

10 ; 

Theorem 6.4.1. Let A\, A2, and Aj be finite sets where rj is a relation from A\ into A2 and r2 is a relation from A2 into Aj. If Ri and R2 
are the adjacency matrices of rj and r2 , respectively, then the product Rj R2 using Boolean arithmetic is the adjacency matrix of the 
composition rj rj. 

Remark: A convenient help in constructing the adjacency matrix of a relation from a set A into a set B is to write the elements from A in a 
column preceding the first column of the adjacency matrix, and the elements of B in a row above the first row. Initially, if in Example 6.4.1 
would be 
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2 5 6 
2 / ^ 
5 

6 I 

and Rij is 1 if and only if (a,-, bjj e r. So that, since the pair (2, 5) e r, the entry of if corresponding to the row labeled 2 and the column labeled 
5 in the matrix is a 1 . 

Example 643, This final example gives an insight into how relational data base programs can systematically answer questions pertain- 
ing to large masses of information. Matrices R (on the left) and S (on the right) define the relations r and s where arbH software a can be run 
with operating system b, and bscit operating system b can run on computer c. 





OSl 


OS2 


OS3 


0S4 




CI 


C2 


C3 


PI 




f 1 





1 


^ 




OSl 


f 1 


1 


\ 


P2 




1 


1 










0S2 





1 





P3 













1 




OS3 








1 


P4 




, 





1 


1 , 




OS4 


, 


1 


1 , 



Although the relation between the software and computers is not implicit from the data given, we can easily compute this information. The 
matrix of rs is RS, which is 



PI 

P2 
P3 
P4 



CI C2 C3 

(I 1 I \ 

1 1 

1 1 1 

I 1 1 



This matrix tells us at a glance which software will run on the computers listed. In this case, all software will run on all computers with the 
exception of program P2 , which will not run on the computer C3 , and program P4 , which will not run on the computer C 1 . 

EXERCISES FOR SECTION 6.4 
A Exercises 

1. Let Ai = (1, 2, 3, 4), A2 = (4, 5, 6|, and A3 = (6, 7, 8). Let ri be the relation from Ai into A2 defined by ri = [(x, y) \ y - x = 2), 
and let r2 be the relation from A2 into A3 defined by r2 = {(x, y)\ y — x = 1}. 

(a) Determine the adjacency matrices of ri and ^2 . 

(b) Use the definition of composition to find ri r2 . 

(c) Verify the result in part by finding the product of the adjacency matrices of ri and r2 . 

2. (a) Determine the adjacency matrix of each relation given via the digraphs in Exercise 3 of Section 6.3. 

(b) Using the matrices found in part (a) above, find r- of each relation in Exercise 3 of Section 6.3. 

(c) Find the digraph of directly fi-om the given digraph and compare your results with those of part (b). 

3. Suppose that the matrices in Example 6.4.2 are relations on {1, 2, 3, 4}. What relations do if and S describe? 

4. Let D be the set of weekdays, Monday through Friday, let W be a set of employees {1, 2, 3) of a tutoring center, and let V be a set of 
computer languages for which tutoring is offered, {A(PL), B(asic), C(++), J(ava), L(isp), P(ython)}. We define s (schedule) fi-om D into Why 
d J w if w is scheduled to work on day d. We also define r fi-om W into V by w r / if w can tutor students in language l.H s and r are defined by 
matrices 



and if: 



(a) compute S R using Boolean arithmetic and give an interpretation of the relation it defines, and 

(b) compute S R using regular arithmetic and give an interpretation of the result describes. 

5. How many different reflexive, symmetric relations are there on a set with three elements? (Hint: Consider the possible matrices.) 

6. Let A = [a, b, c, d). Let r be the relation on A with adjacency matrix 





1 


2 


3 


M 


( 1 





1 ^ 


T 





1 


1 


W 


1 





1 


Th 





1 





F 


, 1 


1 


. 





A 


B 


C 


J 


L 


p 


1 


fO 


1 


1 








1 ^ 


2 


1 


1 





1 





1 


3 


,0 


1 








1 


1 . 
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abed 
a n 
b 10 
c 1110 
c U 1 1 J 

(a) Explain why r is a partial ordering on A. 

(b) Draw its Hasse diagram. 

7. Define relations^ and ^ on {1, 2, 3, 4} by p = {(a, b) : | a-b\ = 1} and q = {(a,b) \ a-foiseven} 

(a) Represent p and q as both graphs and matrices. 

(b) Determine p q, p^, and q^; and represent them clearly in any way. 

B Exercises 

8. (a) Prove that if r is a transitive relation on a set A, then c r. 

(b) Find an example of a transitive relation for which + 

9. We define < on the set of all nxn relation matrices by the rule that if R and S are any two nxn relation matrices, /? < 5 if and only if 
< Sjj for all 1 < j < n. 

(a) Prove that < is a partial ordering on all n x n relation matrices. 

(b) Prove that if < S ^ if^ < , but the converse is not true. 

(c) If if and 5 are matrices of equivalence relations and if < S, how are the equivalence classes defined by if related to the equivalence classes 
defined by S? 
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I 6.5 Closure Operations on Relations 



In Section 6.1, we studied relations and one important operation on relations, namely composition. This operation enables us to generate new 
relations from previously known relations. In Section 6.3, we discussed some key properties of relations. We now wish to consider the situation 
of constructing a new relation from a previously known relation r where, first, contains r and, second, r"*" satisfies the transitive property. 

Consider a telephone network in which the main office a is connected to, and can communicate to, individuals h and c. Both h and c can 
communicate to another person, d; however, the main office cannot communicate with d. Assume communication is only one way, as indicated. 
This situation can be described by the relation r = {(a, b), (a, c), (h, d), (c, li)). We would like to change the system so that the main office 
a can communicate with person d and still maintain the previous system. We, of course, want the most economical system. 

This can be rephrased as follows; Find the smallest relation r* which contains r as a subset and which is transitive; 

r+ = {(a, h), (a, c), (b, d), (c, d), (a, d)]. 

Definition: Transitive Closure. Let A be a set and r be a relation on A. The transitive closure of r, denoted by , is the smallest 
transitive relation that contains ras a subset. 

Example 6.5.1. Let A = {1, 2, 3, 4} , and let >S = {(1, 2), (2, 3), (3, 4)) be a relation on A. This relation is called the successor 
relation on A since each element is related to its successor. How do we compute S'^ ? By inspection we note that (1, 3) must be in .S"*" . Let's 
analyze why. This is so because (1 , 2) e S and (2, 3) e S, and the transitive property forces (1 , 3) to be in S*. 

In general, it follows that if (a, b) eS and (b, c) e S, then (a, c) e<S + . This condition is exactly the membership requirement for the pair 
(a, c) to be in the composition SS = S'.So every element in must be an element in . So far, .S* contains at least .S U >5^ • Iti particu- 
lar, for this example, since <S = ((1, 2), (2, 3), (3, 4)|and52 = {(1, 3), (2, 4)), we have 

S\J = ((1, 2), (2, 3), (3, 4), (1, 3), (2, 4)). 

Is the relation .S U >5'^ transitive? Again, by inspection, (1 , 4) is not an element of 5 U -5^ . but it must be an element of since (1,3) and 
(3, 4) are required to be in S* . From above, (1, 3) eS^ and (3, 4) e <S. Therefore, the composite S^S = produces (1, 4). This shows 
that Q . This process must be continued until the resulting relation is transitive. If A is finite, as is true in this example, the transitive 
closure will be obtained in a finite number of steps. For this example, 

5+ =5U.S2U S^ = {(1, 2), (2, 3), (3, 4), (1, 3), (2, 4), (1,4)). 

Theorem 6.5.1. Ifr is a relation on a set A and \A\ = n, then the transitive closure ofr is the union of the first n powers ofr. That is, 

r+ = r U U '•^ U ••• U r". 
Let's now consider the matrix analogue of the transitive closure. 

Example 6.5.2. Consider the relation 

r = {(1, 4), (2, 1), (2, 2), (2, 3), (3, 2), (4, 3), (4, 5), (5, 1)} 

on the set A = {1,2, 3, 4, 5}. The matrix of r is 

1 ^ 
1110 
i?= 1 

10 1 

1 

Recall that , r-* , . . . can be determined through computing the matrix powers/?^, R^, .... Here , 



p2 = 



(0 





1 





1 > 






/ \ 
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1 
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1 











1 1 1 
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1 1 1 
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1 
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1 1 1 
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1 1 1 
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1 1 1 
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1 1 1 
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,1 


1 








, 






. 1 


1 1 1 


0, 



How do we relate IJ r' to the powers of Rl 
1=1 

Theorem 6.5 J. Let r be a relation on a finite set and let R* be the matrix of r* , the transitive closure of r. Then 
R* = R + R^ + ■■■ + If, using Boolean arithmetic. 

Using this theorem, we find if* is the 5x5 matrix consisting of all 1 ' j, thus, is all of A x A. 
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WARSHALL'S ALGORITHM 

Let r be a relation on the set (1, 2, ... , n] with relation matrix R. The matrix of the transitive closure R* , can be computed by the equation 
i?+ = R + R^ + ■■■ + R".By using ordinary polynomial evaluation methods, you can compute R* with n - I matrix multiplications: 
i?+ = R(I + R(,I +( ■■■R(I + R) •••))). 

For example, if n = 3,R = R(I + R(I + R)). 

We can make use of the fact that if T is a relation matrix, T + T = T due to the fact that 1 + 1 = 1 in Boolean arithmetic. Let 
= R + r2 + ... + Rk .Then 

R = Si 

Si{I + Si) = R{I + R)=R + R^ = S2 

SiU + S2) = iR + R^)U + R + R^) 

= [R + R^) + [R^ + R^) + (R^ + R^) 
= R + R^ + R^ + R'^ =84 

Similarly, 

84(1 + S4) = Ss 
etc.. 

Notice how each matrix multiplication doubles the number of terms that have been added to the sum that you currently have computed. In 
algorithmic form, we can compute R^ as follows. 

Algorithm 6S.1: Transitive Closure Algoritlim 1. Let R be a known relation matrix and let R* be its transitive closure matrix, which is to be 
computed. 

l.O.T:=R 
2.0. Repeat 

2.1 S:= T 

2.2 T:= S(I + S) // using Boolean arithmetic 
Until T = S 

3.0. Terminate with T = R*. 

Notes: 

(a) Often the higher-powered terms in S„ do not contribute anything to R*. When the condition T = S becomes true in Step 2, this is an 
indication that no higher-powered terms are needed. 

(b) To compute R* using this algorithm, you need to perform no more than [logj nl matrix multiplications, where [xl is the least integer that is 

greater than or equal to x. For example, if r is a relation on 25 elements, no more than riog2 25] = 5 matrix multiplications are needed. 

A second algorithm, Warshall's Algorithm, reduces computation time to the time that it takes to perform one matrix multiplication. 

Algoritlim 6S2; Warsliall's Algoritlim. Let i? be a known relation matrix and let if* be its transitive closure matrix, which is to be computed. 
1.0 T:= R 

2.0 FOR k: = Iton DO 

FOR i: = 1 to n DO 
FOR.; := 1 ton DO 

ni,jl:=T[iJl + T[i,kl-T[k,jl 
3.0 Terminate with T = R*. 
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EXERCISES FOR SECTION 6.5 
A Exercises 

1 . Let A and 5 be as in Example 6.5.1. Compute iS* as in Example 6.5.2. Verify your results by checking against the relation obtained in 
Example 6.5.1. 

2. Let A and r be as in Example 6.5.2. Compute the relation r"^ as in Example 6.5.1. Verify your results. 

3. (a) Draw digraphs of the relations S, S^,S^ , and S'^ of Example 6.5.1. 

(b) Verify that in terms of the graph of S,aS'^ bH and only if is reachable from a along a path of any finite nonzero length. 

4. Let r be the relation represented by the digraph in Figure 6.5,1 . 

(a) Find r+ . 

(b) Determine the digraph of r+ directly from the digraph of r. 

(c) Verify your result in part (b) by computing the digraph from your result in part (a). 



5. (a) Define reflexive closure and symmetric closure by imitating the definition of transitive closure. 

(b) Use your definitions to compute the reflexive and symmetric closures of Examples 6.5.1 and 6.5.2. 

(c) What are the transitive reflexive closures of these examples? 

(d) Convince yourself that the reflexive closure of the relation < on the set of positive integers P is <. 

6. What common relations on Z are the transitive closures of the following relations? 

(a) aSb if and only if a + 1 = b. 

(b) a if if and only if \ a - b \ =2. 

B Exercise 

7. (a) Let A be any set and r a relation on A, prove that (_r*)* = r*. 

(b) Is the transitive closure of a symmetric relation always both symmetric and reflexive? Explain. 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 6 

Section 6.1 

1 . Give an example to illustrate how the relation "is a grandparent of" is a composition of the relation "is a parent of on people. 

2. Three students, Melissa, John, and Ted, would like to set up a tutorial program in the languages Pascal, FORTRAN, and COBOL. Melissa is 
proficient in all three languages, John in Pascal and FORTRAN, and Ted in just FORTRAN. 

(a) Let S = I three students), L = {three Languages), and let p be the relation "is proficient in the language of. Describe this relation as 
a set of ordered pairs and illustrate the relation by a diagram similar to that of Figure 6.1.1. 

(b) Two P.C.s are available for tutoring purposes; one has software for Pascal and FORTRAN, and the second only for Pascal. Describe 
by a composite relation which student can tutor on each machine. Illustrate this composite relationship. 

Section 6.2 

3 . Let A = { - 1 , , 1 , 2 ). List the ordered pairs and draw the digraphs of each of the following relations on A . 

(a) r = {ix,y)\y = x+l] 

(b) s = {ix,y) \x^=y^} 

(c) ? = {(x, >>) I X + >>) 

4. List the ordered pairs and draw the digraph of the relation for the relation s of Exercise 2, Section 6.2. 

5. In Figure 6.2, 1, assume the nodes stand for four separate cities where a manufacturer has warehouses, while the arrows represent one-way 
streets. Where should the manufacturer place his main office? Where is the least desirable location? How can we interpret the arrows in both 
directions between nodes 1 and 2? 

6. Suppose we selected a set of points, on the cartesian plane such that the distances between them are all different. Consider the relation c 
on P defined by Pi cP2 if and only if the closest other point in P to Pi is Pj. For example, with the set |^^, V^j | ^ = 0, 1, 2, 3, 4|, the 
graph of c would look like this, where we locate the vertices in our graph at the actual points in P. 




(a) In general, how many pairs are there in c? 

(b) Look at various possible configurations of c's graph of sets of five points. What other possible configurations can you find? Are there any 
general observations that you can make? 

Section 6.3 

7. In Figure 6.3.2 (vii), interpret the four nodes as representing people, and an arrow from one node to another as "being friendly toward". Note 
that some friendships are not mutual. 

(a) Is there any individual in this group unfriendly to everyone else? 

(b) If this group were a committee, who is most likely to be the chairperson; that is, who is friendly toward the most people? 

(c) If an aiTow from one vertex to itself is interpreted as "great personality," does your answer to part b still hold? 

(d) The four people are to be seated at a round table. A person is to be seated between two people only if he is friendly toward both of 
them. Does a seating arrangement exist? Is there more than one? 

8. Let A = [a, b, c, d, e] and let r, s, and / be the following relations on A: 

r = |(a, a), (a, b), (b, b), (b, c), (c, c), (c, d), (d, d), (d, e), (e, e), (e, a)) 

s = {{a, a), (a, b), (a, d), {b, a), (fo, b), (b, d), (c, c), (d, a), (d, b), (d, d), (e, e)\ 

t = {(a, a), (a, b), (b, b), (c, b), (c, c), (d, d), (e, a), (e, b), (e, c), (e, d), (e, e)] 

(a) Which relation is a partial ordering? Draw its Hasse diagram. 

(b) Which relation is an equivalence relation? List its equivalence classes. 
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9. Demonstrate that the relation "living in the same house" on the set of people in a given city is an equivalence relation. State the necessary 
assumption for this to be the case. 

10. Let A = {00, 01, 10, 11}, the set of strings of Os and Is with length two. Given r and j defined by 

jrr J <=> xmdy differ in exactly one position (for example 01 r 11, but not 10 rOl), and 
xsy ^ X and y have the same number of Os . 

(a) Draw a directed graph of r. 

(b) Which of the adjectives, reflexive, symmetric, antisymmetric, and transitive, describe r^ Explain your answers. 

(c) Which of the adjectives, reflexive, symmetric, antisymmetric, and transitive, describe si 

(d) Describe with a directed graph the relation rs. 

11. Determine whether the following relations are partial orderings and/or equivalence relations on the given set: 

(a) C = {students in this class); xryiiix and y have the same grade point average. 

(b) C = {students in this class); x^yiff xis taller than y, 

(c) Rephrase (slightly) the relation in part b so it is a partial ordering relation. 

12. Let A = [a, b, c, d). Draw the graph of a relation where the relation is: 

(a) reflexive, symmetric, but not transitive. 

(b) transitive, but not symmetric and not reflexive. 

(c) both an equivalence relation and a partial ordering. 

Section 6.4 

13. How many symmetric relations can there be on a four-element set? Hint: Think of the possible relation matrices. 

14. Let A = {1 , 2, 3, 4, 5, 6) and let p = {(i, j) \ i divides j] be a relation on A. 

(a) List the elements in p. 

(b) Determine the relation matrix of p. 

(c) Construct the digraph and the Hasse diagram of p. 

15. Let A = {a, b, c). The following matrices describe relations on A: 





i 1 
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] 




f 1 





1 \ 
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1 





(ii) 
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1 > 




I 1 
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1 , 



(a) Draw the graph of the relation. 

(b) Describe each relation as a set of ordered pairs. 

(c) Compute for each relation r. 

Section 6.5 

16. Let the relation s on the set [a, b, c, d, e] be given by the matrix 
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(a) Draw the digraph of s. 

(b) Find the transitive closure of s. Give the adjacency matrix or the digraph or the set of ordered pairs. 
17. Consider the relation r on {1, 2, 3, 4) whose Boolean matrix is 
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(a) Draw the graph of r. 

(b) Determine whether r is reflexive, symmetric, antisymmetric, and/or transitive. Explain fully. 

(c) Find the transitive closure of r and draw the graph of r* . 

18. In a small town a bank (b), school (j), town hall (t), and shopping mall (m) are cormected by a series of narrow one-way streets; a street 
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from the town hall to the bank, one from the bank to the school, one from the school to the shopping mall, and one from the shopping mall to 
the town hall. 

(a) Draw a digraph of this system of roads. 

(b) Find the matrix representation of the digraph in part a. 

(c) Assuming that the given streets cannot be widened, assist the mayor in planning the construction of new roads to increase traffic 
flow. Assume that if there is a one-way street from point a to b and one from point htoc, there should be one from point a to c. 

(d) If you have not done so yet, draw the matrix representation and the graph of your answer to part c and interpret the results for the 

mayor. 

19. The ambassadors of four countries are to meet with the ambassador of the United States (Ai) to discuss world problems. Some countries are 
friendly to each other, some are not, and in certain situations the friendship is one-way. The U.S. ambassador's daughter (who obviously took a 
discrete structures course) assists her father in diagnosing this complex situation and, using the relation "is friendly toward," has come up with 
the following digraph. 




(a) Is there any person friendly to no one? 

(b) Who should be the chairman of this committee; that is, who is friendly to most people? 

(c) The U.S. ambassador would like to graph all friendships that can be developed through intermediaries on the committee. That is, if 
person a is friendly toward person h and person b is friendly toward person c, then a can communicate to c through h. Draw this digraph. Can 
the U.S. ambassador communicate to every person on the committee through some person(s)? If not, what friendships should he develop to do 

so? 
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FUNCTIONS 



GOALS 

In this chapter we will consider some basic concepts of the relations that are called functions. A large variety of mathematical ideas and 
applications can be more completely understood when expressed through the function concept. 

I 7.1 Definition of a Function and Notation 

Definition: Function. A function from a set A into a set B is a relation from A into B such that each element of A is related to exactly one 
element of the set B. The set A is called the domain of the function and the set B is called the codoniain. 

The reader should note that a function/is a relation from A into B with two important restrictions: 

1 . Each element in the set A, the domain of/, must be related to some element of B, the codomain. 

2. The phrase "is related to exactly one element of the set B" means that if (a, b) e f and (a, c) e /, then b = c. 

Example 7.1.1. Let A = (-2, -1,0, 1, 2) and B = |0, 1, 2, 3, 4|, and if s = |(-2, 4), (-1, 1), (0, 0), (1, 1), (2, 4)|, then ^ is a 
function from A into B. 

Example 7.1.2. Let IR be the real numbers. Then L = 3 x) | x e [R| is a function from R into IR, or, more simply, L is a function on 

R. 

We will use a different system of notation for functions than the one we used for relations. If /is a function from the set A into the set B, we will 
write f : A ^ B. 

The reader is probably more familiar with the notation for describing functions that is used in basic algebra or calculus courses. For example, 
J = - or fix) = - both define the function {(x, -) | x e R, x + 0\. Here the domain was assumed to be those elements of R whose substitu- 
tions for X make sense, the nonzero real numbers, and the codomain was assumed to be R. In most cases, we will make a point of listing the 
domain and codomain in addition to describing what the function does in order to define a function. 

The terms mapping, map, and transformation are also used for functions. 

One way to imagine a function and what it does is to think of it as a machine. The machine could be mechanical, electronic, hydraulic, or 
abstract. Imagine that the machine only accepts certain objects as raw materials or input. The possible raw materials make up the domain. Given 
some input, the machine produces a finished product that depends on the input. The possible finished products that we imagine could come out 
of this process make up the codomain. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



Chapter 7 - Functions 



129 



Example 7.13. f: [R ^ R defined by / (x) = x- is an alternate description of / = x^)\x e R|. 

Definition: Image of an Element. Let f : A -> B, (read "let f be a function from the set A into the set B"). If a e A, then f [a) is used to 
denote that element of B to which a is related, f (a) is called the image of a, or, more precisely, the image of a under f. We write f (a) = b to 
indicate that the image of a is b. 

In Example 7.1.3, the image of 2 under/is 4; that is, / (2) = 4. In Example 7.1.1, the image of -1 underiis l;thatis, ^(-1) = 1. 

Definition: Range of a Function. The range of a function is the set of images of its domain. Iff'X -> Y, then the range of f is denoted 
f(X), and 

f(X) = {f{a)\a eX\ 

= \b eY\ J a eX such that f (a) = b] 

Note that the range of a function is a subset of its codomain. f (X) is also read as "the image of the set X under the function/' or simply "the 
image of/." 

In Example 7.1.1, s (A) = |0, 1, 4|. Notice that 2 and 3 are not images of any element of A. In addition, note that both 1 and 4 are related to 
more than one element of the domain s(\) = s{-l) = 1 and s{2) = s{-2) = 4. This does not violates the definition of a function. Go back 
and read the definition if this isn't clear to you. 

In Example 7.1.2, the range of L is equal to its codomain, IR. If b is any real number, we can demonstrate that it belongs to L(R) by finding a 
real number x for which L(x) = b. By the definition of L, L{x) = 3x, which leads us to the equation x = b. This equation always has a 

solution, j;thusL(R) = R. 

The formula that we used to describe image of a real number under L, L{x) = 3x, is preferred over the set notation for L due to its brevity. 
Any time a function can be described with a rule or formula, we will use this form of description. In Example 7.1 .1 , the image of each element 
of A is its square. To describe that fact, we write ^(a) = (a e A),orS:A B defined by 5 (a) = a^. 
There are many ways that a function can be described. The complexity of the function often dictates its representation. 

Example 7.1.4. Suppose a survey of 1,000 persons is done asking how many hours of television each watches per day. Consider the 
function W: (0, 1, ... , 24) ^ (0, 1, 2, 1000) defined by 

W(t) = number of persons who gave a response of f hours. 

This function will probably have no formula such as the ones for s and L above. Besides listing the data in a table, a bar graph might be a 
good way to represent W. 

Example 7.1.5. Consider the function m : P ^ Q defined by the set 
m = ((1, 1), (2, 1/2), (3, 9), (4, 1/4), (5, 25), ...). 
No simple single formula could describe m, but if we assume that the pattern given continues, we can write 

( x^ if X is odd 
m{x) = \ . 

V 1 /x if X IS even 

FUNCTIONS OF TWO VARIABLES 

If the domain of a function is the Cartesian product of two sets, then our notation and terminology is changed slightly. For example, consider 
the function C : l^sl xf^ -> defined by C((«i, «2)) = n^ + nl- «i ^2 + 10- For this function, we would drop one set of parentheses and write 
C (4, 2) = 22, not C ((4, 2)) = 22. We call C a function of two variables. From one point of view, this function is no different from any others 
that we have seen. The elements of the domain happen to be slightly more complicated. On the other hand, we can look at the individual 
components of the ordered pairs as being separate. If we interpret C as giving us the cost of producing quantities of two products, we can 
imagine varying ni while «2 is fixed, or vice versa. 

Mathematica Note 

There are several ways to define a function in Mathematica. One way is using a Function expression. For example the function / : Z ^ Z, 
where /(x) = x^ can be defined by evaluating the expression 

f = Function [x, x^] 

2 

X I— > X 

Then we can compute the image of various numbers. Since Mathematica isn't a typed programming language the concept of a domain doesn't 
apply unless you specify a bit more structure. As it stands we can take the image of numbers, strings, matrices, or symbols, among other things: 

{f [-5], f [2.5], f ["Hello"], f [(^ A )]' ^^®'} 

|25, 6.25, Hello-, [ ^ j ]. S^] 
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Notice the syntax of Mathematica is to use square brackets such as f [ x ] instead of parentheses, f ( x ) for computing images under/. Notice 
that the square of the matrix isn't the matrix product, rather it is simpler componentwise product. 

If we want to restrict the possible inputs to a function that implements/, we might use the following alternate way of defining a function, 
fa [x_? IntegerQ] : = x^ 

If the "?IntegerQ" had been left out, fa and f would be identical, but as it is defined, the formula for fa only applies to integers. This 
makes fa more similar to/. We get what we expect when we ask for the image of an integer: 

{fa[-5] , fa[25 !]} 

(25 , 240 597 637 008 332 048 087 335 626 345 604 448 256 000 000 000 000) 
If we ask for the image of anything that isn't an integer, the expression is left unevaluated. 

{fa [2. 5], fa ["Hello"], fa[| ^ A )]' ^^^^A 

{fa[2.5], fa[Hello], f a [ { { 1 , 2}, {3, -1}}], fa[e]} 
It is also possible to program Mathematica to issue an error message for these unintended cases. 

Sage Note 

There are several ways to define a function in Sage. The simplest way to implement/is as follows. 



sage: f(x)=x"2 
sage: f 
X I — > x"2 

sage : f (4) 
16 

sage : f (-5.1) 
26.0100000000000 



Sage is built upon the programming language Python, which is a strongly typed language and so you can't evaluate expressions such as 
f ( "Hello" ) . However a function like f above will accept any type of number, so a bit more work is needed to restrict the inputs of f to the 
integers. 

A second way to define a function in Sage is based on Python syntax. 



sage: def fa(x): return x''2 



sage: fa(-4) 
16 

sage: fa(5.1) 
26.0100000000000 



We close this section with two examples of relations that are not functions. 

Example 7.1.6. Let A = fi = |1, 2, 3| and let /= |(1, 2), (2, 3)|. Here / is not a function since / does not act on, or "use," all 
elements of A. 

Example 7.1.7. Let A = S = |1, 2, 3) and let g = |(1, 2), (2, 3), (2, 1), (3, 2)). We note that g acts on all of A. However, g is still 
not a function since (2, 3) e g and (2, 1) e g and the condition on each domain being related to exactly one element of the codomain is 
violated. 

EXERCISES FOR SECTION 7.1 
A Exercises 

1. Let A = |1, 2, 3, 4)andB = {a,b, c, <i). Determine which of the 
following are functions. Explain. 

(a) / c A X B, where / = |(1, a), (2, b), (3, c), (4, d)]. 

(b) gcAxB, where g = ((1, a), (2, a), (3, b), (A,d)]. 

(c) h cAxfi,whereh = ((1, a), (2, b), (3, c)). 

(d) k cAx fi, where = ((1, a), (2, b), (2, c), (3, a), (4, a)]. 
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(e) LcAxA, where L = {(1, 1), (2, 1), (3, 1), (4, 



1)}. 



2. Let A be a set and let S be any subset of A. Let Xs '■ 



A -» {0, 1} be defined by 




1 ifxeS 
iixtS 



The function Xs^ is called the characteristic Junction of S, 

(a) If A = {a, b, c}andS = [a, i}, Ust the element of . 

(b) If A = [a, b, c, d, e}andS = {a, c, e}, , list the element of 

(c) If A = {a, b, c}, what are ;t'0 and 

3 . Find the ranges of each of the relations that are functions in Exercise 1 . 

4. Find the ranges of the following functions on Z: 

(a) g = {{x, 4x +l)\x & Z) 

(b) h(x) = least integer that is greater than or equal to V \ x\ . 

(c) P{x) = X + 10. 

B Exercise 

5. If I A I and | 6 | are both finite, how many different functions are there from A into B? 

6. Let/be a function with domain A and codomain B. Consider the relation K" c A x A defined on the domain of/by (x, y) e K" if and only 

if / (x) = f (y). The relation K is called the kernel off. 

(a) Prove that K is an equivalence relation. 

(b) For the specific case of A = Z, where Z is the set of integers, let / : Z ^ Z be defined by / (x) = x^. Describe the equivalence classes of 
the kernel for this specific function. 
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I 7.2 Injective, Surjective, and Bijective Functions 

Consider the following functions; 

Example 7.2.1. LetA = {1, 2, 3, 4}andB = {a, b, c, d), and defined/ : A ^ Bby 

/(I) = a, /(2) = b, /(3) = cand/(4) = d. 
Example 7.2.2. LetA = {1, 2, 3, 4)andB = {a, b, c, d}, and defined g : A ^ Bby 

= a,gi2) = h, gi3) = a and^?(4) = h. 

The function in the first example gives us more information about the set B than the second function. Since A clearly has four elements /tells us 
that the set B contains at least four elements since each element of the set A is mapped onto one and only one element of the set B, The 
properties that / has and g does not have are the most basic properties that we look for in a function. The following definitions summarize the 
basic vocabulary for function properties. 

Definition Injective Function: A function f: A->Bis injective if 
a, be A, ai=b ^ f{q) ^ fib). 
Notice that the condition for a injective function is equivalent to 

a, h e A, f (a) = f (h) ^ a = h 
Injective functions are also called injections, or one-to-one functions . 

Definition Surjective Function A function f: A -> B is surjective if its range, /(A), is equal to its codomian B 
Notice that the condition for a surjective function is equivalent to 

For all e B, there exists a 6 A such that /(a) = b. 
Surjective functions are also called surjections, or onto functions. 

Definition Bijective Function: A function f: A -fB is bijective if it is both injective and surjective. 
Bijective functions are also called one-to-one, onto functions. 

Example 7.23. The function/of Example 7.2.1 is bijective. The function g of Example 7.2.2 is neither injective nor surjective. 

Example 7.2.4. Let A = {1, 2, 3} and B = {a, b, c, d}, and define / : A ^ B by /(I) = 6, /(2) = c, and /(3) = a. Then/is injective but 

not surjective. 

Example 7.2.5. The characteristic function, xs in Exercise 2 of Section 7.1, is surjective if S is a proper subset of A, but never injective 
if |A| > 2. 

Example 7.2.6. Let A be the set of students who are sitting in a classroom, and let B be the set of seats in the classroom, and let s be the 
function which maps each student into the chair he or she is sitting in. When is s one to one? When is it onto? Under normal circumstances, ^ 
would always be one to one since no two different students would be in the same seat. In order for s to be onto, we need all seats to be used, so 
s is onto if the classroom is filled to capacity. 

Fimctions can also be used for counting the elements in large finite sets or in infinite sets. Let's say we wished to coimt the occupants in an 
auditorium containing 1 ,500 seats. If each seat is occupied, the answer is obvious, namely 1 ,500 people. What we have done is to set up a one- 
to-one correspondence, or bijection, from seats to people. We formalize in a definition. 

Definition: Cardinality. Two sets are said to have the same cardinality if there exists a bijection between them. 

The function/ in Example 7.2.1 serves to show that the two sets A = (1, 2, 3, 4) and B = {a, b, c, d] have the same cardinality. Notice in 
applying the definition of cardinality, we don't actually appear to count either set, we just match up the elements. However, matching the letters 
in B with the number 1, 2, 3, an 4. is precisely how we count the letters. 

Definition: Countable Set. If a set is finite or has the same cardinality as the set of positive integers, it is called a countable set. 
Ifasetisfinite, anditscardinaUtyisn,thenithasthesamecardinaUty astheset{l, 2, 3, n). 

Example 7.2.7. The alphabet {A, B, C, ZJhascardinaUty 26 through the following bijectioninto the set {1, 2, 3, 26} 
ABC" Z 

J, J, J, J, 

1 2 3 ••• 26 
Reminder: 2P= {fceP | b= 2feforsomefe e P} 

Example 7.2.8. The set 2P of even positive integers has the same cardinality as the set P of positive integers. To prove this, we must 
find a bijection from P to 2 P. Such a function isn't unique, but this one is the simplest: / : P -» 2 P by /(m) = 2m. Two statements must 
be proven to justify our claim that/is a bijection. 
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(1) /is one-to-one. 

Proof: Let a, b e P and assume that /(a) = /(fc). We must prove that a = b. 
fid) = fib) =^ 2a = 2b ^ a = b. ■ 

(2) /is onto. 

Proof: Let b e 2P. We want to show that there exists an element a e P such that / (a) = b. If b e 2P,b = 2 ^ for some ^ e P by the 
definition of 2 P. So we have f(k) = 2k = b. Hence, each element of 2P is the image of some element of P. ■ 

An even easier way to look at any function with P as its domain by creating a Ust of the form /(I), /(2), /(3), .... which is 2,4,6,... for 
the function we've just defined. This infinite Ust clearly has no dupUcate entries and every even positive integer appears in the list eventually. 

Bijections with domain IP; A function f :¥ A is a bijection if the infinite list f{l), f(2), f{3), . . . contains no duplicates, and every 

element of A appears on in the list. 

Readers who have studied real analysis should realize that the set of rational numbers is a countable set, while the set of real numbers is not a 
countable set. See the exercises at the end of this section for an example of such a set. 

We close this section with an theorem called the Pigeonhole Principle, which has numerous applications even though it is an obvious, common- 
sense statement. Never underestimate the importance of simple ideas. The Pigeonhole Principle states that if there are more pigeons than 
pigeonholes, then two or more pigeons must share the same pigeonhole. A more rigorous mathematical statement of the principle follows. 

The Pigeonhole Principle. Let f be a function from a finite set X onto a finite set Y. Ifnzl and \X\ >n\Y\, then there exists an 
element of Y that is the image of at least n + 1 elements ofX. 

Example 7.2.9. Assume that a room contains four students with the first names John, James, and Mary. Prove that two students have the 
same first name. We can visualize a mapping from the set of students to the set of first names; each student has a first name. The pigeonhole 
principle applies with n = I, and we can conclude that two of the students have the same first name. 

EXERCISES FOR SECTION 7.2 
A Exercises 

1. Determine which of the functions in Exercise 1 of Section 7.1 are one- to-one and which are onto. 

2. (a) Determine all bijections from the {1, 2, 3} into {a, b, c). 
(b) Determine all bijections from {1, 2, 3} into {a, b, c, d}. 

3. Which of the following are one-to-one, onto, or both? 

(a) /i : R ^ R defined by /, W = - x. 

(b) /2 :Z ^ Z defined by /2 W = -x + 2. 

(c) /, : FSj xl^ ^ 1^ defined by /(/ k) = 2-'' 3*. 

(d) /4 :P P defined by /4 (n) = rn/21, where [xl is the ceiling of x, the smallest integer greater than or equal to ;<:. 

(e) f^ -.^ ^ N defined by / (n) =n^ + n. 

(f) : ^ 1^ X 1^ defined by f(, (n) = (2 «, 2n+ 1). 

4. Which of the following are injections, surjections, or bijections on R, the set of real numbers? 

(a) f(x) = -2x. 

(b) g(x) = x^- I. 

IX X < 

(C) h{x) = { 2 

^ x-^ X > 

(d) q(x) = 2' 

(e) r(x) = x^ 

(f) six) = x^ - X. 

5. Suppose that m pairs of socks are mixed up in your sock drawer. Use the Pigeonhole Principle to explain why, if you pick m + 1 socks at 
random, at least two will make up a matching pair, 

6. In your own words explain the statement "The sets of integers and even integers have the same cardinality." 

7. Let A = (1, 2, 3, 4, 5). Find functions, if they exist that have the properties specified below 
(a) A function that is one-to-one and onto. 
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(b) A function that is neither one-to-one nor onto 

(c) A function that is one-to-one but not onto. 

(d) A function that is onto but not one-to-one. 

8. (a) Define functions, if they exist, on the positive integers, P, with the same properties as in Exercise 7 (if possible). 

(b) Let A and B be finite sets where \A \ = \B \. Is it possible to define a function f : A ^ B that is one-to-one but not onto? Is it possible 
to find a fimction g : A -» B that is onto but not one-to-one? 

B Exercises 

9. (a) Prove that the set of natural numbers is countable. 

(b) Prove that the set of integers is countable. 

(c) Prove that the set of rational numbers is countable. 

10. (a) Prove that the set of finite strings of O's and I's is countable. 

(b) Prove that the set of odd integers is countable. 

(c) Prove that the set x is countable. 

1 1 . Use the Pigeonhole Principle to prove that an injection cannot exist between a finite set A and a finite set B if the cardinality of A is greater 
than the cardinality of B. 

12. The important properties of relations are not generally of interest for functions. Most functions are not reflexive, symmetric, antisymmet- 
ric, or transitive. Can you give examples of functions that do have these properties? 

13. Prove that the set of all infinite sequences of O's and I's is not a countable set (i. e., that it is an uncountable set). 

14. Prove that the set of all functions on the integers is an uncountable set. 
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I 7.3 Composition, Identity, and Inverse 

Now that we have a good understanding of what a function is, our next step is to consider an important operation on functions. Our purpose is 
not to develop the algebra of functions as completely as we did for the algebras of logic, matrices, and sets, but the reader should be aware of 
the similarities between the algebra of functions and that of matrices. We first define equality of functions. 

Definition: Equality of Functions. Let f,g:A-> B; that is, let f and g both be functions from A into B. Then fis equal to g (i. e., 

f = S) ifond only iff (x) = g (x)for all x e A. 

Two functions that have different domains caimot be equal. For example, f : J.^Z. defined by f{x) = x^ and g : R ^ R defined by gix) = x^ 
are not equal even though the formula that defines them is the same. 

On the other hand, it is not uncommon for two functions to be equal even though they are defined differently. For example consider the 
functions h and k, where h: {-1,0, 1,2}^{0, 1,2} is defined by h(x)= \x\ and k: {-1,0, 1,2}^{0, 1,2} is defined by 

k(_x) = -j-+x^+ J appear to be very different functions. However, they are equal because = k(x) forx = -1, 0, 1, and2. 
COMPOSITION 

One of the most important operations on functions is that of composition. 

Definition: Composition of Functions. Let f : A ^ B and g : B -> C. Then the composition off followed by g, written g °f is a function 

from A into C defined by (g °f) (x) = g(f (x)), which is read " g off of x." 

The reader should note that it is traditional to write the composition of functions from right to left. Thus, in the above definition, the first 
function performed in computing g of, which is/. On the other hand, for relations, the composition rs is read from left to right, so that the first 
relation is r. 

Example 73.1. 

(a) Let /: {1, 2, 3) ^ [a, h] be defined by / (I) = a, f(2) = a, and /(3) = b. Let g : {a, b] ^ |5, 6, 7) be defined by g(a) =5 and 
gib) = 7. Then go/ ; {1, 2, 3) ^ {5, 6, 7} is defined by (go/)(l) = 5, (g°/)(2) = 5, and (g°/)(3) = 7. For example, 
(go/)(i) = g (/(/)) = g(a) = 5. Note that fog is not defined. Why? 

(b) Let / : R -> R be defined by / (x) = x^ and let g : R -> R be defined by g (x) = 3 x + 1 . Then, since 

(g°/)W = ^(/W) = gi^') = 3x3 + 1, 

we have g°f : R -> R is defined by (g o /) (x) = 3 x' + 1 . Here / = g is also defined and / o g : R ^ R is defined by ( / o g) (x) = (3 x + 1)^ . 
Moreover, since 3 X + 1 (3 x + 1)-' for at least one real number, go/ :/= /°g. Therefore, the commutative law is not true for functions 
under the operation of composition. However, the associative law is true for functions under the operation of composition. 

Theorem 73.1. Function composition is associative. That is, iff : A B,g : B C, and h :C D, then h °{g °f) = (h °g) °/. 

Proof Technique: In order to prove that two functions are equal, we must use the definition of equality of functions. Assuming that the 
functions have the same domain, they are equal if, for each domain element, the images of that element under the two fimctions are equal. 

Proof; We wish to prove that {ho{gof)) (x) = ((h°g)°f) (x) for all x e A, which is the domain of both functions. 

(ho(gof))(x) = h((g°f)(x)) by the definition of composition of /; with go/ 
= h{g(f(x))) by the definition of composition of g with/ 

Similarly, 

iih°g)°f)ix) = {h°g)(f{x)) by the definition of composition of /!°g with/ 
= h(g(f(x))) by the definition of composition of h withg 

Notice that no matter how the functions the expression h°g°f is grouped, the final image of any element of xeA is h(g(f(x))) and so 
h°(g°f) = (h°g)°f. m 

If /is a function from a set A onto itself, we can find f° f, f° f° /, ... ,which we write as f^ , f-^ , .... This idea can be expressed more 
elegantly as follows; If / : A -> A, the repeated composition of/ with itself is defined recursively as; 

Definition: Powers of Functions. Let f : A-fA. 

(1) /' = /,■ that is, f\a) = f(a),for a e A. 

(2) Forn>l,r^^ = f''f"; thatis, /"+' (a) = /(/"(a)) anda eA. 

Two useful theorems concerning composition are given below. The proofs are left for the exercises. 

Theorem 732. If f : A ^ B ar\d g : B C are injections, then g°f : A C is an injection. 

Theorem 733. Iff .• A -» B and g ;B -^C are surjections, then g°f: A -» C isa surjection. 

We would now like to define the concepts of identity and inverse for functions under composition. The motivation and descriptions of the 
definitions of these terms come from the definitions of the terms in the set of real numbers and for matrices. For real numbers, the numbers 
and 1 play the unique role that x + = + x = x and x • 1 = 1 • x = x for any real number x. and 1 are the identity elements for the reals 
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under the operations of addition and multiplication, respectively. Similarly, the n x n zero matrix and the n x n identity matrix / are such that 
for any n x n matrix A,A+0 = + A = A and AI = I A = I. Hence, an elegant way of defining the identity function under the operation of 
composition would be to imitate the above well-known facts. 

IDENTITY FUNCTION 

Definition; Identity Function. For any set A, the identity function on A is a function from A onto A, denoted by i (or, more specifically, 
Ia) such that i(a) = a for alia e A 

Based on the definition of i, we can show that for all functions / : A -> A, f oi = iof = f. 

An alternate description is that the identity function on A is the function j*(a) = a for all a e A. This can be proven from the above definition. 

Example 7.3.2. If A = (1, 2, 3 ), then the identity function r : A ^ A is defined by /(I) = 1, /(2) = 2,and/(3) = 3. 

Example 7.3.3. The identity function on IR is ; : K ^ R defined by / (x) = x. 
INVERSE FUNCTIONS 

We will introduce the inverse of a function with a special case: the inverse of a function on a set. After you've taken the time to imderstand this 
concept, you can read about the inverse of a function from one set into another. The reader is encouraged to reread the definition of the inverse 
of a matrix in Section 5.4 to see that the following definition of the inverse function is a direct analogue of that definition. 

Definition: Inverse Function. Let f : A -> A. If there exists a function g .• A -» A such that g°f = f^g = i, then g is called the 

inverse of f and is denoted by , read "/' inverse . " 

Notice that in the definition we refer to "the inverse" as opposed to "an inverse." It can be proven that a function can never have more than one 
inverse (see exercises). 

An alternate description of the inverse of a function, which can be proven from the definition, is as follows: 

Let / : A ^ A be such that / (a) = b. Then when it exists, /"Ms a function from A to A such that f~^(b) = a. Note that "undoes" what/ 
does. 

Example 7.3.4. Let A = |1, 2, 3) and let / be the function defined on A such that /(I) = 2, /(2) = 3, and /(3) = 1. Then 
/-I : A ^ Ais defined by /"'(/) = 3,/-'(2) = l,and/-'(3) = 2. 

Example 73.5. If g : R -> R is defined by g(x) = x^ , then g~^ is the function that undoes what g does. Since g cubes real nimibers, g ' 

must be the "reverse" process, namely, takes cube roots. Therefore, g ' : R ^ R is defined by g~^(x) = '\fx'. We should show that 
°g = i and gog"' = We will do the first, and the reader is encouraged to do the second. 

' ° g) (x) = g" ' (g(x)) Definition of composition 
= g~^(_x^) Definition of g 

= -\ll^ Definition of g~' 

= X Definition of cube root 

= iix) Definition of the identity function 

Therefore, g~Mg = why? 

The definition of the inverse of a function alludes to the fact that not all functions have inverses. How do we determine when the inverse of a 
function exists? 

Theorem 73.4. Let f : A^ A. f^' exists if and only if fis a bijection; i. e.fis one-to-one and onto. 

Proof: (=>) In this half of the proof, assume that /"^ exists and we must prove that/is one-to-one and onto. To do so, it is convenient 
for us to use the relation notation, where f{s) = f is equivalent to (s, t) e /. To prove that / is one-to-one, assume that /(a) = f{b) = c. 
Alternatively, that means (a, c)and(fc, c) are elements of / . We must show that a =b. Since (a, b), (c, b) e /, (c, a) and (c, fc) are in 
/"^ . By the fact that /"' is a function and c cannot have two images, a and b must be equal, so /is one-to-one. 

Next, to prove that/is onto, observe that for / ' to be a function, it must use all of its domain, namely A. Let b be any element of A. 
Then b has an image under f'^ , f~^(,b). Another way of writing this is {b, f~^(,b)) e /"^ , By the definition of the inverse, this is equivalent to 

(/"'(fc), b) e /. Hence, b is in the range of/. Since b was chosen arbitrarily, this shows that the range of/must be all of A. 

(<^ ) Assume/is one-to-one and onto and we are to prove /"' exists. We leave this half of the proof to the reader. ■ 
Definition; Permutation. A bijection of a set A into itself is called a permutation of A, 

Next, we will consider the situation where f: A-»B and B is not necessarily equal to A. How do we define the inverse in this case? 

Definition: Inverse of a Function (General Case). Let f : A B, If there exists a function g : B ^ A such that g °f = u and 

f ° a = 's . th^n g is called the inverse of f and is denoted by /"' , read "/inverse." 

Note the slightly more complicated condition for the inverse in this case because the domains of /° g and g =/ are different if A and B are 
different. The proof of the following theorem isn't really very different from the special case where A = B. 

Theorem 73 J. Letf .• A -> B. f'' exists if and only iff is a bijection. 



Applied Discrete Structures by Alan Doerr & Keimeth Levasseur is licensed under a Creative Commons Attribution-Nonconmiercial-ShareAlike 3.0 United States License. 



Chapter 7 - Functions 



137 



Example 7.3.6. Let A ={1,2, 3) and B = {a, b, c]. Define / : A ^ B by /(I) = a, f (2) = b, and /(3) = c. Then g: B ^ A 
defined hy g(a) = 1 , g(b) = 2, and g (c) = 3 is the inverse of/. 

(g°/)(l)= 1 (/og)(a) = a 

(go/)(2) = 2 }^ go/ = and {fog)(b) = b]^ fog = ig 

(go/) (3) = 3 (/og)(c) = c 

'^1^ Mathematica Note 

Computer languages have many functions that also have inverses. Here are a few examples in Mathematica: 
RotateRigh-t and RotateLeft are both functions on lists. For example, 
RotateRight [ { 1 , 2, 3, 4, 5}] 

(5, 1,2, 3,4) 

RotateLeft[{l, 2, 3, 4, 5}] 

(2, 3,4, 5, 1) 

RotateLeft [RotateRight [{1, 2, 3, 4, 5}]] 

(1, 2, 3, 4, 5) 

IntegerDigits and FromDigits are inverses of one another, with domains being the positive integers and lists of digits, respectively. 
For example 

IntegerDigits [1492] 

(1, 4, 9, 21 

FromDigits [ {1, 4, 9, 2}] 

1492 

FromDigits [IntegerDigits [193 410] ] 

193410 

IntegerDigits [FromDigits [ {4, 3, 1, 7, 8}]] 

(4, 3, 1, 7, 8) 

EXERCISES FOR SECTION 7.3 
A Exercises 

1. Let A = (1, 2, 3, 4, 5), B = {a, b, c, d, e, /), and C = {+, -). Define f : A ^ Bhy f{k) equal to the /t* letter in the alphabet, and 
define g : B ^ C by g(Q:) = + if a- is a vowel and g(a) = - if a- is a consonant. 

(a) Find go/. 

(b) Does it make sense to discuss /og? If not, why not? 

(c) Does /-' exist? Why? 

(d) Does g-' exist? Why? 

2. Let A = |1, 2, 3).Define/: A^Aby /(I) = 2, /(2) = l,and /(3) = 3. Find /^ ,p ,fsmdf-\ 

3. Let A = |1, 2, 3). 

(a) List all permutations of A. 

(b) Find the inverse of each of the permutations of part a. 

(c) Find the square of each of the permutations of part a. 

(d) Show that the composition of any two permutations of A is a permutation of A. 

(e) Prove that if A be any set where the | A | = n, then the number of permutations of A is n !. 

4. Define s, u, and d, all functions on the integers, by s{n) = rp- ,u(n) = n + 1 , and d{n) = n - \ . Determine: 
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(a) u ° s ° d 

(b) s ° u° d 

(c) d ° s ° u 

5. Based on the definition of the identity function, show that for all functions / : A ^ A, /o / = /o/ = /. 

6. Inverse images. If/is any function from A into B, we can describe the inverse image of from B into PiA), which is also commonly denoted 
/"'.Iffe e B, f~^{b) = {a eA I /(a) = fo). If /does have an inverse, the inverse image of 6 is {/"H^)}- 

(a) Letg : R -» R be defined by = What are g-H4),g-H0) andg-H-1)? 

(b) Ifr: R^Z, where r{x) = fx}, whatisr-^l)? 

7. Let/, g, and ft aU be functions from Z into Z defined by/ (n) = n + 5,g(_n) = n - 2, and /!(n) = n^. Define: 

(a) /°g 
(b) 

(c) /o/, 

8. Define the following functions on the integers by / (fc) = k + l,g(k) = 2^, and h(k) = \k/2] 

(a) Which of these functions are one-to-one? 

(b) Which of these functions are onto? 

(c) Express in simplest terms the compositions f°g,s°f,8°h, h ° g, and , 

B Exercises 

9. State and prove a theorem on inverse functions analogous to Theorem 5.4.1 (if a matrix has an inverse, that inverse is unique). 

10. Let/and g be functions whose inverses exist. Prove that (f°g)~^ = °f'^ . (Hint: See Exercise 3 of Section 5.4.) 

11. Prove Theorems 7.3.2 and 7.3.3. 

1 2 . Prove the second half of Theorem 7 .3 .4 . 

13. Prove by induction that if n> 2 and fi, fi . ••• . /« are invertible functions on some nonempty set A, then 
(/i°/2° ••• °/n)~' = /r^°""°/2~'°/i~'- The basis has been taken care of in Exercise 10. 

C Exercises 

14. (a) Our definition of cardinality states that two sets, A and B, have the same cardinality if there exists a bijection between the two sets. 
Why does it not matter whether the bijection is from A into B or B into A? (b) Prove that "has the same cardinality as" is an equivalence relation 

on sets. 

15. Construct atable listing as many "Laws of Function Composition" as you can identify. Use previous lists of laws as a guide. 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 7 

Section 7.1 

1. If/:Z^Zisdefmedby/(a) = 2|a| + l, 

(a) What is the domain of/? 

(b) What is the codomain of/? 

(c) What is the image of -5 under/? 

(d) What is the range of/? 

(e) Given that f(a) =11, can you tell exactly what a must be? 

2. Let / : P -> P, where/(a) is the largest power of two that evenly divides a; for example, /(12) = 4, /(9) = 1, and/(8) = 8. Describe the 
equivalence classes of the kernel off. 

3. Are any of the relations given in Figure 6.3.2 functions on the set A of nodes? Explain. 

4. Let ?7 be a set with subsets A and B. 

(a) Show that g : ?7 -> {0, 1} defined by g{a) = mm(CA(a), Caia)) is the characteristic function of A OB. (See Exercise 2 of Section 7.1 .) 

(b) What characteristic function is h : U -> {0, 1} defined by h(a) = max(CA(a), Cfl(a))? 

(c) How are the characteristic functions of A and A'^ related? 
Section 7.2 

5. Recall that every function is a relation. Suppose that/is a function on {1, 2, n], where n is a positive integer. 

(a) What must the matrix of/ look like? In other words, what distinguishes the matrix of/ from the matrix of the typical relation? How many 
1 s appear in the matrix? 

(b) If /is a bijection, what further property does the matrix of /have? (Hint for those who know chess: We wiU call the matrix of a bijection a 
"rook matrix.") 

6. LetS = {a, c}andletr = {l,2, 3, 4} 

(a) Give an example of a relation fi-om S to r that is not a function. 

(b) Give an example of a relation from S to T that is an injection. 

(c) How many injections are there from S to r? 

(d) How many suijections are there from S to r? 

7. Prove that the following sets are countable. 

(a) {n^inel^} 

(b) {; :neP) 

(c) {3, 9,27, 81, ...}U{2, 4, 8, 16, ...} 

8 . Prove that if A and B are countable sets , then their union is also countable . 

9. Prove that if A and B are any two sets, then |(AxB)| = |(BxA)|; that is, prove that AxB and Bx A have the same cardinaUty. 

10. Let A be a finite set. 

(a) Is every injection / : A -» A a surjection? Explain. 

(b) Is every surjection / : A ^ A an injection? Explain. 

(c) Are parts a and b true if A is an infinite set? Give a counterexample. 

1 1 . Two children playing "spy" have devised a code that consists of spelling each word backwards and replacing the letters a and f by m and q, 
respectively. Let A be the set of letters in the EngUsh alphabet. Assume the boys use characters only from A. Explain this code using the 
concept of functions. Explain why this code wiU or wiU not work, using the concept of functions. 

Section 7.3 
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12. Consider the functions /, g : R -> R defined by fix) = Sx + 5 and g(x) = x^. 

(a) Show that/is injective. 

(b) Show that/is surjective. 

(c) ¥mdf-\x). 

(d) Findg°/W. 

(e) Find/ogW. 

13. If/,g, h:!H -» , where /(a) = 10a, g(a) = a+ 10, and /!(a) = a div 10 (fte^Morienn'n a -=- 10), calculate 

(a) (fog) (a). 

(b) gHa) = (g °g) (a). 

(c) (/!=./) (a). 

(d) (hog) (a). 

14. If /, M, : Fsl -> f^, where/(a) = 2a, M(a) = a + 1, andd(a) = max(0, a - 1), calculate 

(a) (/°M)(a). 

(b) /'(a). 

(c) (rfo(/o,<)) (a). 

(d) (d°u) (a) = a; therefore, what is d°u! 

(e) Explain why d is no? the inverse of u. 

15. Let/be a function on A = [a, h, c, d} such that /(a) = c, and /(of) = b. 

(a) What are f(b) and /(c) if / = /? 

(b) What are /(i>) and /(c) if /o/ = j? 

16. Let A be a nonempty set. Prove that if/is a bijection on A and f°f = /, then/is the identity function, ;'. Hint: You have seen a similar 
proof in matrix algebra. 



led/' 

A bijection from a set to itself is also referred to as a permutation of the set. Let ;r be a permutation of {a, fo, c, d} such that a becomes n{a), b 

becomes nib), etc. 



18. (For those who have had calculus): Let V be the set of all functions that are defined and have infinitely many derivatives over some fixed 
interval of the real line. Let /eV and define D :V ^Vhy D{f) = f , that is, D is the act of taking the derivative. Hence: 

Z)(x2 + 3x + 5) = 2jc + 3. 

(a) Is £) a function? Explain. 

(b) Is £) a bijection? Explain. 

(c) What is the interpretation of = D oD, D^ . . . , D"? 

(d) Does D"' exist? If not, is there a function in calculus that is "close to" ? What is it? 

19. The exponential function / : R ^ R* defined by f(x) = b" has as its inverse the logarithmic function g(x) = log;, x. 

(a) What are the domain and codomain of g? 

(b) Prove that g is the inverse of / 

20. In high school, many of us saw the notation sin"' x. Why is this notation used? 




Letfi = 



7i(a) n(b) 
n(c) n(d) 



) 



. How many permutations of ;7r leave the determinant of A invariant, that is, det A = detB? 
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RECURSION AND RECURRENCE RELATIONS 



GOALS 

An essential tool that anyone interested in computer science must master is how to think recursively. The ability to understand definitions, 
concepts, algorithms, etc., that are presented recursively and the ability to put thoughts into a recursive framework are essential in computer 
science. One of our goals in this chapter is to help the reader become more comfortable with recursion in its commonly encountered forms. 

A second goal is to discuss recurrence relations. We will concentrate on methods of solving recurrence relations, including an introduction to 
generating functions. 

I 8.1 The Many Faces of Recursion 

Consider the following definitions, all of which should be somewhat familiar to you. When reading them, concentrate on how they are similar. 

Example 8.1.1. A very common alternate notation for the binomial coefficient ^ ^ j is C(n; k). We will use the latter notation in this chapter. 
Here is a recursive definition of binomial coefficients. 



Definition: Binomial Coefficients. Assume n >0 and n > k > 0. 

C{n:0) = 1 
C(n, n) = I 

and Cin: k) = C(n - \; k) + C{n - I; k - 1) ifn>k> 0. 



POLYNOMIALS AND THEIR EVALUATION 

Definition: Polynomial Expression in x over S (Non-Recursive). Let n be an integer, n >0. An n"" degree polynomial in x is an 

expression of the form a„x" + a„_ix"^' + ••• + ajX+ag, where a„, a„_j aj,ag are elements of some designated set of numbers, S, 

called the set of coefficients and a„ ^ 0. 

We refer to x as a variable here, although the more precise term for x is an indeterminate. There is a distinction between the terms 
indeterminate and variable, but that distinction will not come into play in our discussions. 

Zeroth degree polynomials are called constant polynomials and are simply elements of the set of coefficients. 

This definition is often introduced in algebra courses to describe expressions such as /(n) = 4n^ + 2n^ - 8« + 9, a third-degree, or cubic. 
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polynomial in n. This definitions has has drawbacks when the variable is given a value and the expression must be evaluated, For example, 
suppose that n = 7. Your first impulse is likely to do this: 

/(7) = 4x73 + 2x7^-8x7 + 9 

= 4x343 +2x 49 - 8x7 + 9= 1423 

A count of the number of operations performed shows that five multiplications and three additions/subtractions were performed. The first two 
multiplications compute 7- and 7', and the last three mutiply the powers of 7 times the coefficients. This gives you the four terms; and adding/- 
subtacting a list of k numbers requires ^ - 1 addition/subtractions. The following definition of a polynomial expression suggests another more 
efficient method of evaluation. 

Definition: Polynomial Expression in x overS (Recursive). Let She a set of coejficients and x any variable. 

(a) A zeroth degree polynomial expression in x over S is a nonzero element of S. 

(b) For n > 1, an n"' degree polynomial expression in x over S is an expression of the form p{x) x + a where p (x) is an (n - 1)"' degree 
polynomial expression in x and a e S. 

We can easily verify that f(n) is a third-degree polynomial expression in n over the Z based on this definition: 

f(n)= (4n^+ 2n-H)n + 9= ((4 n + 2) « - 8) « + 9 

Notice that 4 is a zeroth degree polynomial since it is an integer. Therefore 4n + 2 is a first-degree polynomial; therefore, (4 n + 2) n - 8 is a 
second-degree polynomial in n over Z; therefore, f (n) is a third-degree polynomial in n over Z. The final expression for f(n) is called its 
telescoping form. If we use it to calculate /(7), we need only three multiplications and three additions /subtractions. This is called Horner's 
method for evaluating a polynomial expression. 

Example 8.12. (a) The telescoping form of p{x) = + 12 x^ -6x^ + x + 6is (((5x + 12) x - 6)x + l)x + 6. Using Horner's 
method, computing the value of p(c) requires four multiplications and four additions/subtractions for any real number c. 

(b) g{x) = -x^ + 3x'^ + 2x^ + X has the telescoping form ((((- X + 3)x)x + 2)x + l)x. 

Many computer languages represent polynomials as lists of coefficients, usually starting with the constant term. For example, 

g(x) = -x' + 3x'' + 2x^ + X would be represented with the list |0, 1,2,0,3,-1). In both Mai/iemaiica and Sage, polynomial expressions 
can be entered and manipulated, so the list representation is only internal. Some lower-leveled languages do require users to program polyno- 
mial operations with lists. We will leave these programming issues to another source. 

Example 8.1.3. A recursive algorithm for a binary search of a sorted list of items: r = |r(l), r(2) ... , r{n)} represent a list of n items 

sorted by a numeric key in descending order. The 7* item is denoted r(j) and its key value by r{j).key. For example, each item might 
contain data on the buildings in a city and the key value might be the height of the building. Then r(l) would be the item for the tallest 
building. The algorithm BinarySearch (7, k) can be applied to search for an item in r with key value C. This would be accomplished by the 
execution of BinarySearch (1, n). When the algorithm is completed, the variable Found will have a value of true if an item with the desired 
key value was found, and the value of location will be the index of an item whose key is C. If Found stays false, no such item exists in 
the list. The general idea behind the algorithm is illustrated in Figure 8.1 .2. 

Hiaiitr Kcy\ Lower fci!y> 















— 1 




* * m 








« 4 ■ 





FIGURE 8.1.2 Illustration of BinarySearch 



In this algorithm. Found and location are "global" variables to execution of the algorithm. 

BinarySearch (j, k) : 
Found = False 
ltJ<K 
Then 

Mid = L(j+k)/2J 
If r(Mid).key C 
Then 

location — Mid 
Found = TRUE 
Else 

If /•(Mid).key < C 

Then execute BinarySearch (j, Mid - 1) 
Eise execute BinarySearch(Mid + 1 , A:) 

For the next two examples, consider a sequence of numbers to be a list of numbers consisting of a zeroth number, first number, second number, 
... . If a sequence is given the name S, the k'^ number of 5, is usually written S^. or S{k). 
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Example 8.1.4 Define the sequence of numbers B by 
Bo = 100 and 
Bj. = 1.08 B^_, for/t> 1 

These rules stipulate that each number in the list is 1 .08 times the previous number, with the starting number equal to 100. For example 

B3 = I.O8B2 

= 1.08(1. 08 Bi) 

= 1.08 (1.08 (1.08 Bo)) 

= 1.08(1.08(1.08 100)) 

= 1.08^ 100 

= 125.971 

Example 8.1.5. The Fibonacci sequence is the sequence F defined by 
Fo = 1, Fi = 1 and 
Fk = Fk-2 + F^-ifor k>2. 

RECURSION 

All of the previous examples were presented recursively. That is, every "object" is described in one of two forms. One form is by a simple 
definition, which is usually called the basis for the recursion. The second form is by a recursive description in which objects are described in 
terms of themselves, with the following qualification. What is essential for a proper use of recursion is that the objects can be expressed in 
terms of simpler objects, where "simpler" means closer to the basis of the recursion. To avoid what might be considered a circular definition, 
the basis must be reached after a finite number of applications of the recursion. 

To determine, for example, the fourth item in the Fibonacci sequence we repeatedly apply the recursive rule for F until we are left with an 
expression involving Fq and Fi : 

Fa= Fi + F3 

= (Fo + Fi) + (F, + F2) 
= (Fo + Fi) + (Fi +(Fo + Fi)) 
= (1 + 1) + (1 + (1 + 1)) 
= 5 

ITERATION 

On the other hand, we could compute a term in the Fibonacci sequence, say F5 by starting with the basis terms and working forward as follows: 

F, = Fq + F, = 1 + 1 = 2 
F3 = Fi + F2 = 1 + 2 = 3 
F4 = F2 + F3 = 2 + 3 = 5 

F5 = F3 + F4 = 3 + 5 = 8 

This is called an iterative computation of the Fibonacci sequence. Here we start with the basis and work our way forward to a less simple 
number, such as. Try to compute F5 using the recursive definition for F as we did for F4 . It wiU take much more time than it would have taken 
to do the computations above. Iterative computations usually tend to be faster than computations that apply recursion. Therefore, one useful 
skill is being able to convert a recursive formula into a nonrecursive formula, such as one that requires only iteration or a faster method, if 
possible. 

An iterative formula for C(n; k) is also much more efficient than an application of the recursive definition. The recursive definition is not 
without its merits, however. First, the recursive equation is often useful in manipulating algebraic expressions involving binomial coefficients. 
Second, it gives us an insight into the combinatoric interpretation of C(n; k). In choosing k elements from {1, 2, n}, there are 
C(n - 1; fe) ways of choosing all k from {1,2, . . . , n - 1}, and there are C(n - 1; ^ - 1) ways of choosing the k elements if n is to be 
selected and the remaining k - \ elements come from {1, 2, n - 1}. Note how we used the Law of Addition from Chapter 2 in our 
reasoning. 

BinarySearch Revisited. In the binary search algorithm, the place where recursion is used is easy to pick out. When an item is examined 
and the key is not the one you want, the search is cut down to a sublist of no more than half the number of items that you were searching in 
before. Obviously, this is a simpler search. The basis is hidden in the algorithm. The two cases that complete the search can be thought of as 
the basis. Either you find an item that you want, or the sublist that you have been left to search in is empty (j > k). 

BinarySearch can be translated without much difficulty into any language that allows recursive calls to its subprograms. The advantage to such 
a program is that its coding would be much shorter than a nonrecursive program that does a binary search. However, in most cases the recursive 
version will be slower and require more memory at execution time. 

INDUCTION AND RECURSION 

The definition of the positive integers in terms of Peano's Postulates (Section 3.7) is a recursive definition. The basis element is the number 1 
and the recursion is that if n is a positive integer, then so is its successor. In this case, n is the simple object and the recursion is of a forward 
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type. Of course, the validity of an induction proof is based on our acceptance of this definition. Therefore, the appearance of induction proofs 
when recursion is used is no coincidence. 

Example 8.1.6. A formula for the sequence B in Example 8.1.4 is B = 100 (1.08)* for k>O.A proof by induction follows: If A: = 0, then 
B = 100(1.08)° = 100, as defined. Now assume that for some k>l, the formula for is true. 

B|j+j = 1 .08 Bj. by the recursive definition 

= 1 .08 ( 100 ( 1 .08)*) by the induction hypothesis 

= 100 (1 .08)*+^ hence the formula is true for A: + 1 

The formula that we have just proven for B is called a closed form expression. It involves no recursion or summation signs. 

Definition: Closed Form Expression. Let E = E(xj, X2, ■ ■ ., x„) he an algebraic expression involving variables Xj, X2, ...yXn which are 
allowed to take on values from some predetermined set. E is a closed form expression if there exists a number B such that the evaluation of 
E with any allowed values of the variables will take no more than B operations (alternatively, B time units). 

n 

Example 8.1.7. The sum E{n) = 2 ^ is not a closed form expression because the number of additions needed evaluate £(n) grows 
indefinitely with n. A closed form expression that computes the value of £(n) is ^^^y^, which only requiresB = 3 operations. 

EXERCISES FOR SECTION 8.1 
A Exercises 

1. By the recursive definition of binomial coefficients, C(5; 2) = C(4; 2) + C(4; 1). Continue expanding C(5; 2) to express it in terms of 
quantities defined by the basis. Check your result by applying the factorial definition of C (n; k). 

2. Define the sequence L by Zo = 5 and for ^ > 1 , L = 2 L;^. j - 7. Determine L4 and prove by induction that = 7 - 2*"^^ . 

3. Ijstpix) = :^ + 3x'* - I5x^ + X - 10. 

(a) Write p(_x) in telescoping form. 

(b) Use a calculator to compute p (3) using the original form of p(x) . 

(c) Use a calculator to compute p (3) using the telescoping form of p(x) . 

(d) Compare your speed in parts b and c. 
B Exercises 

4. Suppose that a list of nine items, (r(l), r(2) r(9)), is sorted by key in decending order so that r (3). key = 12 and r (4).key = 10. List the 

executions of BinarySearch that would be needed to complete Binary Search( 1,9) for: 

(a) C = 12 

(b) C= 11 

Assume that distinct items have distinct keys. 

5. What is wrong with the following definition of / : R ^ R? 

/(O) = land/(;c) = f (x/2)/2it x i=0. 
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I 8.2 Sequences 

Definition: Sequence. A sequence is a function from the natural numbers into some predetermined set. The image of any natural 

number k can be written interchangeably as S (k) or and is called the fe"* term of S. The variable k is called the index or argument of the 
sequence. 

For example, a sequence of integers would be a function S : -» Z . 
Example 8.2.1. 

(a) The sequence A defined by A (A:) = - k, k > 0, is a sequence of integers. 

(b) The sequence B defined reciu-sively by B(0) = 2 and B (k) = B{k - 1) + 3 for^ > 1 is a sequence of integers. The terms of B can be 
computed either by applying the recursion formula or by iteration. For example; 

B(3) = B(2) + 3 

= (B(l) + 3) + 3 
= ((B(0) + 3) + 3) + 3) 
= ((2 + 3) + 3) + 3 
= 11 

or 

B(l) = B(0) + 3 = 2 + 3 = 5 

B(2) = B(l) + 3 = 5 + 8 = 8 
B (3) = B (2) + 3 = 8 + 3 = 11 . 

(c) Let Cr be the number of strings of O's and I's of length r having no consecutive zeros. These terms define a sequence C of integers. 
Remarks; 

(1) A sequence is often called a discrete function. 

(2) Although it is important to keep in mind that a sequence is a function, another useful way of visualizing a sequence is as a list. For 
example, the sequence A could be written as (0, 0, 2, 6, 12, 20, . . .). Finite sequences can appear much the same way when they are the 
input to or output from a computer. The index of a sequence can be thought of as a time variable. Imagine the terms of a sequence flashing on a 
screen every second. The s^^ would be what you see in the k^ second. It is convenient to use terminology like this in describing sequences. For 
example, the terms that precede the k*' term of A would be A (0), A (1), . . . , A{k - 1). They might be called the earlier terms. 

A FUNDAMENTAL PROBLEM 

Given the definition of any sequence, a fundamental problem that we will concern ourselves with is to devise a method for determining any 
specific term in a minimum amount of time. Generally, time can be equated with the number of operations needed. In counting operations, the 
application of a recursive formula would be considered an operation. 

Example 8.2.2. 

(a) The terms of A in Example 8.2.1 are very easy to compute because of the closed form expression. No matter what term you decide to 
compute, only three operations need to be performed. 

(b) How to compute the terms of B is not so clear. Suppose that you wanted to know B (100). One approach would be to apply the definition 
recursively: 

B(IOO) = B(99) + 3 = (B(98) + 3) + 3 = ... 

The recursion equation for B would be applied 100 times and 100 additions would then follow. To compute B {k) by this method, 2 k operations 
are needed. An iterative computation of B (k) is an improvement: 

B(l) = B(0) + 3 = 2 + 3 = 5 
B(2) = B(l) + 3= 5 + 3 = 8 
etc. 

Only k additions are needed. This still isn't a good situation. As k gets large, we take more and more time to compute B {k). The formula 
B{k) = B(k - 1) + 3 is called a recurrence relation on B. The process of finding a closed form expression for B (k), one that requires no more 
than some fixed number of operations, is called solving the recurrence relation. 

(c) The determination of Q is a standard kind of problem in combinatorics. One solution is by way of a recurrence relation. In fact, many 
problems in combinatorics are most easily solved by first searching for a recurrence relation and then solving it. The following observation will 

suggest the recurrence relation that we need to determine Q : If ^ > 2, then every string of O's and I's with length k and no two consecutive O's 
is either l.S(._i or 01 .Sj._2, where Sj^_^ and .Sj,_2 are strings with no two consecutive O's of length k - 1 and k - 2 respectively. From this 
observation we can see that Q = Cj._2 + Cj,_i for k>2. The terms Co = 1 and C\ = 2 are easy to determine by enumeration. Now, by 
iteration, any Q can be easily determined. For example, C5 = 21 can be computed with five additions. A closed form expression for Q would 
be an improvement. Note that the recurrence relation for Q is identical to the one for the Fibonacci sequence (Example 8.1 .4). Only the basis is 
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different. 

EXERCISES FOR SECTION 8.2 
A Exercises 

1 . Prove by induction that B{k) = 3 k + 2,fc>0,isa closed form expression for the sequence B in Example 8.2.1 . 

2. (a) Consider sequence Q defined by Q{k) = 2k + 9,k > I. Complete the table below and determine a recurrence relation that describes Q. 

k Q(k) Q(k) - Q(k - 1) 

2 

3 

4 

5 

6 

7 

(b) Let A(k) = k^ - k, k > 0. Complete the table below and determine a recurrence relation for A . Notice that 

(A(k) -A(k-l)- (A(k -l)-A{k-2)) = A(k) - 2 Aik - I) + Aik - 2) 

k A(k) A(k) -A(k-l) A(k) - 2 A{k - I) + A(k - 2) 

2 

3 

4 

5 

3. Given k lines (k > 0) on a plane such that no two lines are parallel and no three lines meet at the same point, let P (k) be the number of 
regions into which the lines divide the plane (including the infinite ones (see Figure 8.2.1). Describe geometrically how the recurrence relation 
P(k) = P(k—l) + can be obtained. Given that P (0) = 1, determine P (5). 




FIGURE 8.2.1 Exercises 



4. A sample of a radioactive substance is expected to decay by 0.15 percent each hour. If w,, ; > 0, is the weight of the sample t hours into an 
experiment, write a recurrence relation for w. 

B Exercise 

5. Let M (n) be the number of multiplications needed to evaluate an degree polynomial. Use the recursive definition of a polynomial 
expression to define M recursively. 
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I 8.3 Recurrence Relations 

In this section we will begin our study of recurrence relations and their solutions. Our primary focus will be on the class of finite order linear 
recurrence relations with constant coefficients (shortened to finite order linear relations). First, we will examine closed form expressions from 
which these relations arise. Second, we will present an algorithm for solving them. In later sections we will consider some other common 
relations (8.4) and introduce two additional tools for studying recurrence relations: generating functions (8.5) and matrix methods (Chapter 12). 

Definition: Recurrence Relation. Let S he a sequence of numbers, A recurrence relation on S is a formula that relates all but a finite 
number of terms of S to previous terms of S. That is, there is a kg in the domain of S such that if k > kg, then S(k) is expressed in terms of 
some (and possibly all) of the terms that precede S(k). If the domain of S is [0 , 1,2, ...], the terms 5(0), 5(1), S(kg - 1) are not defined 
by the recurrence formula. Their values are the initial conditions (or boundary conditions, or basis) that complete the definition ofS. 

Example 8.3.1. 

(a) The Fibonacci sequence is defined by the recurrence relation = F^_2 + Fj._i , k>2, with the initial conditions Fq = 1 and Fi = 1 . The 
recurrence relation is called a second-order relation because F^. depends on the two previous terms of F. Recall that the sequence C in Section 
8.2 can be defined with the same recurrence relation, but with different initial conditions. 

(b) The relation r(fe) = 2T(k - if - kT{k - 3) is a third-order recurrence relation. If values of r(0), r(l), and r(2) are specified, then T 
is completely defined. 

(c) The recurrence relation S(n) = 5(Ln/2J) + 5, n > 0, with 5(0) = Ohas infinite order. To determine 5(n) when n is even, you must go 
back n/2 terms. Since n/2 grows imboimded with n, no finite order can be given to S. 



SOLVING RECURRENCE RELATIONS 

Sequences are often most easily defined with a recurrence relation; however, the calculation of terms by directly applying a recurrence relation 
can be time consuming. The process of determining a closed form expression for the terms of a sequence from its recurrence relation is called 
solving the relation. There is no single technique or algorithm that can be used to solve all recurrence relations. In fact, some recurrence 
relations caimot be solved. The relation that defines T above is one such example. Most of the recurrence relations that you are likely to 
encounter in the future as classified as finite order linear recurrence relations with constant coefficients. This class is the one that we wiU spend 
most of our time with in this chapter. 

Definition: n"' Order Linear Recurrence Relation. Let S be a sequence of numbers with domain k >0. An n"' order linear recurrence 
relation on S with constant coefficients is a recurrence relation that can be written in the form 

S{k) + CiS(k - 1) + ...+C„S{k -n) = f{k) fork >n 
where C I, C2, .... C„ are constants and f is a numeric function that is defined for k >n. 

Note: We wiU shorten the name of this class of relations to n* order linear relations. Therefore, in further discussions, S(k) + 2 k S(k — 1) = 
would not be considered a first-order linear relation. 

Example 83.2. 

(a) The Fibonacci sequence is defined by the second-order linear relation because Fj^ - F|j_i - F^_2 = 

(b) The relation P(J) + 2P (J — 3) = is a third-order linear relation. In this case, Ci= €2= 0. 

(c) The relation A(k) = 2(_A(k - 1) -1- ^) can be written as A(k) - 2 A{k - V) = 2k. Therefore, it is a first-order linear relation. 
RECURRENCE RELATIONS OBTAINED FROM "SOLUTIONS" 

Before giving an algorithm for solving finite order linear relations, we will examine recurrence relations that arise from certain closed form 
expressions. The closed form expressions are selected so that we will obtain finite order linear relations from them. This approach may seem a 
bit contrived, but if you were to write down a few simple algebraic expressions, chances are that most of them would be similar to the ones we 
are about to examine. 

Example 833. 

(a) Consider D, defined by D(yt) =5-2* ,k > 0. If > 1, 

D(k) =5-2* = 2-5-2*^-i = 2D{k - 1). 

Therefore, D satisfies the first order linear relation D{k) - 2D{k - \) = Q and the initial condition D (0) = 5 serves as an initial condition 
forD. 

(b) If C{k) = 3*-i + 2*+' +k,k>Q, quite a bit more algebraic manipulation is required to get our result: 



Applied Discrete Structures by Alan Doerr & Keimeth Levasseur is licensed under a Creative Commons Attribution-Nonconmiercial-ShareAlike 3.0 United States License. 



148 



Chapter 8 - Recursion and Recurrence Relations 



C (k) = 3*-i + 2*+i + A: Original equation 

3C(,k-l) =3*-' +3-2* + 3(A:- 1) Substitute it - 1 fori: and multipy by 3 

Subtract tlie second equation from tiie first 

C(k) - 3 C(k - 1) = -2* -2k + 3 3*"^ term is eliminated, this is a first order relation 

2C(fc- l)-6C(A:-2) = -2*-2(2(fc- l) + 3) Substitute A: - 1 for A: in the 3«' equation, mult.by2 

Subtract the fourth equation from the third equation 

Cik) -5C{k-l)-6C(k-2) = 2k-l 2*+' term eliminated, this is a 2"'' order relation 

The recurrence relation that we have just obtained, defined for k > 2, together with the initial conditions C (0) = 7/3andC(l) = 5, define 
C. We could do more algebra to obtain a third-order linear relation in this case. 

Table 8.3.1 summarizes our results together with a few other examples that we will let the reader derive. Based on these results, we might 
conjecture that any closed form expression for a sequence that combines exponential expressions and polynomial expressions will be solutions 
of finite order linear relations. Not only is this true, but the converse is true: a finite order linear relation defines a closed form expression that is 
similar to the ones that were just examined. The only additional information that is needed is a set of initial conditions. 

Closed Form Expression Recurrence Relation 

D(yfe) = 5-2* D{k)-2D{k-V) = Q 

C(A:) = 3*-' +2*+i +k C{k)-2C(k- I) - 6C {k - 2) = 2k-l 

Q(k) = 2k + 9 Q{k)-Q(k-\) = 2 

A{k) = k^-k A{k)-2A{k-\)+A{k-2) = 2 

B(k) = 2k^ + 1 B{k)-2B(k-\) + B{k-2) = A 

G{k) = 2 -4* - 5 (-3)* G{k) - G{k - 1) + 12 G(k - 2) = 

J{k) = (3 + /t) 2* J(k) -AJ(k-l)+AJ(k-2)=0 
Table 8.3.1 

Recurrence Relation Obtained from Certain Sequences 

Definition: Homogeneous Recurrence Relation. An n'^ order linear relation is homogeneous if f (k) = for all k. For each recur- 
rence relation S(k) + CjS(k - 1) + ...+ C„S(k — n) = f(k), the associated homogeneous relation is 
S(k) + C,S{k - 1) + ...+ CnS{k-n) =0 

Example 8.3.4. D{k) - 2D(k - 1) = is a first-order homogeneous relation. Since it can also be written e& D(k) = 2D(k— 1), it 
should be no surprise that it arose from an expression that involves powers of 2 (see Example 8.3.3a). More generally, you would expect that 

the solution of L{k) - aL{k - 1) would involve a* . Actually, the solution is L{k) = L{0)a'^ , where the value of L(0) is given by the the 
initial condition. 

Example 8.3.5. Consider the second-order homogeneous relation S(k) — lS{k - 1) + \2S(k- 2) = together with the initial 
conditions 5(0) =4 and S(l) = 4. From our discussion above, we can predict that the solution to this relation involves terms of the form 
b a*, where b and a are nonzero constants that must be determined. If the solution were to equal this quantity exacfly , then 

S{k) = ba'' 
S{k - \) = b a*"'^ 
S(yt-2) = fca*-2 
Substitute these expressons into the recurrence relation to get 

ia*-7ia*-i -I- 12ia*-' =0 (Eq8.3a) 
Each term on the left-hand side of the equation has a factor of b a*"^, which is nonzero. Dividing through by this common factor yields 
a^ - la + 12 = {a - 3)(a - 4) = 0. (Eq8.3b) 

Therefore, the only possible values of a are 3 and 4. Equation (8.3b) is called the characteristic equation of the recurrence relation. The fact is 

that our original recurrence relation is true for any sequence of the form S{k) = bii'^ + b2 4*, where bi and b2 are real numbers. This set of 
sequences is called the general solution of the recurrence relation. If we didn't have initial conditions for S, we would stop here. The initial 
conditions make it possible for us to obtain definite values for bi and 



|S(0)=4| |^>i3<' + fo24<' = 4| I bi+b2 = 4 ^ 
U(l)=4/ ^Ui3>+fo24i=4/ ^\3bi+4b2=4l 



The solution of this set of simultaneous equations is 6 1 = 12 and 62 = -8 and so the solution is S(k) = 12 3* - 8 4*. 

Definition: Characteristic Equation. The characteristic equation of the homogeneous n* order linear relation 
S(k) + Ci S(k - 1) +... + €„ S(k — n) =0 is the nth degree polynomial equation 
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a" + i Cjo"-} = a" + Ci a"-^ + ■■■ + C„_i x + C„ = 

.7=1 

The left-hand side of this equation is called the characteristic polynomial. 
Example 83.6. 

(a) The characteristic equation of F{k) - F {k - 1) - F {k - 2) = is - a - 1=0. 

(b) The characteristic equation of QW + IQih - 1) - 3G(*: - 2) - 6 G(*: - 4) = is a"* + 2a' - 3a^ - 6 = 0. Note that the 
absence of a Q - 3) term means that there is not an x'"'^ = x term appearing in the characteristic equation. 

Algorithm 83.1 : Algorithm for Solving Homogeneous n* Order Linear Relations. 

(a) Write out the characteristic equation of the relation S{k) +CiS(k - 1) + ... + C„S(k — n) =0, which is 
(f + Cia"-' + ■■■ +C„_jx +C„ =0. 

(b) Find all roots of the characteristic equation, called characteristic roots. 

(c) If there are n distinct characteristic roots, aj, 02, a„, then the general solution of the recurrence relation is 
S (k) = hj aj'' + h2 a2 + ■■■ +h„ fl„*. If there are fewer than n characteristic roots, then at least one root is a multiple root. If aj is a double 

root, then the bjaj' term is replaced with (bjQ +bj] k^a/. In general, if aj is a root of multiplicity p, then the bja/ term is replaced with 

[bjo+bjik + -+bj^^_,)kP-')ajK 

(d) Ifn initial conditions are given, obtain n linear equations in n unknowns (the bj' sjrom Step (c)) by substitution. If possible, solve 
these equations to determine a final form for S{k). 

Although this algorithm is valid for all values of n, there are limits to the size of n for which the algorithm is feasible. Using just a pencil and 
paper, we can always solve second-order equations. The quadratic formula for the roots of a + bx + c = Ois 

-h±-\j h^-Aac 



The solutions of + Ci a + C2 = are then 

i(-Ci + VCi2-4C2 ) and i (-Ci - V^i^ - 4 C2 ) 

Although cubic and quartic formulas exist, they are too lengthy to introduce here. For this reason, the only higher-order relations (n > 3) that 
you could be expected to solve by hand are ones for which there is an easy factorization of the characteristic polynomial. 

Example 83.7. Suppose that T is defined by T{k) =1 T{k - \) - \QT{k - 2), with , r(0) = 4 and r(l) = 17. We can solve this 
recurrence relation with Algorithm 8.3.1: 

(a) Note that we had written the recurrence relation in "nonstandard" form. To avoid errors in this easy step, you might consider a rearrange- 
ment of the equation to, in this case, T(k) - 7 T(k - 1) -1- 10 T(k - 2) = 0. Therefore, the characteristic equation is - 7 a + 10 = 0. 

(b) The characteristic roots are^ ^7 + V 49 - 40 j = 5 and ^ - V 49 - 40 j = 2. These roots can be just as easily obtained by factoring the 
characteristic polynomial into (a - 5)(a - 2). 

(c) The general solution of the recurrence relation is T(k) = bil'^ + bj 5* , 

|7"(0) = 4| |foi20+i.25'' = 4| I fci+fo2 = 4 ^ 
^' lr(l) = 17/ ^Ui2i+i25'=4/ ^\lbi+5bi = \ll 

The simulations equations have the solution b\ = \ and b2 = 3, Therefore, T{k) = 2* -1- 3-5*. 

Here is one rule that might come in handy: If the coefficients of the characteristic polynomial are all integers, with the constant term equal to 
m, then the only possible rational characteristic roots are divisors of m (both positive and negative). 

With the aid of a computer (or possibly only a calculator), we can increase n. Approximations of the characteristic roots can be obtained by 
any of several well-known methods, some of which are part of standard software packages. There is no general rule that specifies the values 
of n for which numerical approximations wiU be feasible. The accuracy that you get wiU depend on the relation that you try to solve. (See 
Exercise 17 of this section.) 

Example 83.8. Solves (A:) - 1 S{k - 2) + 6S{k - 3) = 0, where 5(0) =8,5(1) = 6, and S (2) = 22. 

(a) The characteristic equation is a' - 7a + 6 = 0. 

(b) The only rational roots that we can attempt are ± 1, ±2, ±3, and ± 6. By checking these, we obtain the three roots 1,2, and —3. 

(c) The general solution is S{k) = 1* + ^2 2* + i3(-3)*. The first term can simply be written bi . 
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S(0) = 8 bi+b2 + b3 = S 

(d) ( S(l) = 6 I =>{ bi+2b2-3b3 = 6 I 
5(20 = 22 ^1+4^2 + 9*3 = 22 

You can solve this system by elimination to obtain bi = 5, ^2 = 2, and 63 = 1 . Therefore, 

S(k) = 5 + 2-2* + (-3)* = 5 + 2*+' + (-3)* 
Example 83.9. Solve D (it) - SD(k - I) + 16D(k - 2) = 0, where D (2) = 16 and D (3) = 80. 

(a) Characteristic equation; - 8a + 16 = 0. 

(b) a^- 8a + 16 = (a - 4)^. Therefore, there is a double characteristic root, 4. 

(c) General solution: D(k) = (b^o + bn k) 4*. 

|D(2) = 16| |(fo,o + fcii2)42 = 16| I 16fo,o + 32fo„ = 16 I 2 | 

^ ' \O(3) = 80/ "^1(^^10 + fell 3)43 = 80/ ^l64^)io + 192fc„=80/ ^ \bn = ^' 

Therefore D (A:) = (1/2 + (1/4) A:) 4*= (2 + A:) 4*-'. 

SOLUTION OF NONHOMOGENEOUS FINITE ORDER LINEAR RELATIONS 

Our algorithm for nonhomogeneous relations will not be as complete as for the homogeneous case. This is due to the fact that different right- 
hand sides (f(k)'s) call for different procedures in obtaining a particular solution in Steps (b) and (c). 

Algorithm 83 J: Algorithm for Solving Nonhomogeneous Finite Order Linear Relations. 
To solve the recurrence relation S{k) + Cj S(k - 1) +...+ €„ S(k — n) = f(k): 

(a) Write the associated homogeneous relation and find its general solution (Steps (a) through (c) of Algorithm 8.3.1). Call this the homoge- 
neous solution, ^''\k). 

(b) Start to obtain what is called a particular solution, ^>'\k) of the recurrence relation by taking an educated guess at the form of a 
particular solu tion. For a large class of right-hand sides, this is not really a guess, since the particular solution is often the same type of 
function as f (k) (see Table 8.3.2). 

Right Hand Side, f(k) Form of & particular Solution, 5*'''(fe) 

constant, q constant, d 

linear function + Qj k linear function dg + djk 

m* degree polynomial, m* degree polynomial, 

qg -hqjk -h ■■■ -l-q„k" dg -l-djk -h ■■■ -hdmlf 

exponential function q ct exponential function d a* 

Table 832 

Particular Solutions for Given Right-hand Sides 

(c) Substitute your guess from Step (b) into the recurrence relation. If you made a good guess, you should be able to determine the unknown 
coefficients of your guess. If you made a wrong guess, it should be apparent from the result of this substitution, so go back to Step (b). 

(d) The general solution of the recurrence relation is the sum of the homogeneous and particular solutions. If no conditions are given, then 
you are finished. Ifn initial conditions are given, they will translate to n linear equations in n unknowns and solve the system, if possible, to 
get a complete solution. 

Example 83.10. Solve 5(fe) + 5 5(fe - 1) = 9, withS(0) = 6. 

(a) The associated homogeneous relation, 5(fe) + 5 5(fe — 1) = has the characteristic equation a + 5 = 0; therefore, a = -5. The 

homogeneous solution is 5**'(^) = b (-5)*. 

(b) Since the right-hand side is a constant, we guess that the particular solution will be a constant, d. 

(c) If we substitute 5*'''(^) = d into the recurrence relation, we get + 5d = 9,or6af = 9. Therefore, 5*''' (^) = 1.5 

(d) The general solution of the recurrence relation is 

S(k) = S^''\k) + S^P\k) = b (-5)* + 1.5 
The initial condition wiU give us one equation to solve in order to determine b. 

S(0) = 6 => ^(-5)" + 1.5 = 6 -I- 1.5 = 6 

Therefore,/) = 4.5 mid S(k) = 4.5 (-5)* + 1.5. 
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Example 83.11. Consider r (it) - 7 T (k - I) + lOT (k - 2) = 6 + 8)twithr(0) = landr(l) = 2. 

(a) From Example 8.3.7, we know that T^''\k) = foi 2*^ + b2 5*^. Caution: Don't apply the initial conditions to rC"' until you add r""'! 

(b) Since the right-hand side is a linear polynomial, T^^^ is linear; that is, T^Xk) =do+di k. 

(c) Substitution into the recurrence relation yields: 

(do + dik)-l (do + d\{k- 1)) + 10 {do + di{k - 2)) = 6 + S k 

(4do - 13di) + {4di)k = 6 + Sk 

Two polynomials are equal only if their coefficients are equal. Therefore, 

j 4do - I3di=6 \ j do = S \ 
1 4di=8 l^\di=2l 

(d) Use the general solution T(k) = foi 2* + 62 5* + 8 + 2 A: and the initial conditions to get a final solution: 

I r(0) = 1 > I + + 8 = 1 I 
1 r(l) = 2 / I 2 foi + 5 62 + 10 = 2 / 



Therefore, Tik) = -9-2* + 2-5* + 8 + 2/t 

A quick note on interest rates: When a quantity, such as a savings account balance, is increased by some fixed percent, it is most 
easily computed with a multipier. In the case of an 8% increase, the multier is 1.08 because any original amount A, has 0.08 A added to it, so 
that the new balance is 

A + 0.08A = (1 +0.08)A = 1.08A. 

Another example is that if the interest rate is 3.5%, the multiplier would be 1 .035. This presumes that the interest is applied a the end of year 
for 3.5% annual interest, often called simple interest. If the interest is applied monthly, and we assume a simplifed case where each month 

has the same length, the multiplier after every month would be |l + » 1.0292. After a year passes, this multiplier would be applied 12 
times, which is the same as multiplying by 1 .0292'^ » 1 .3556. That increase from 1 .035 to 1 .3556 is the effect of compound interest. 

Example 83.12. Suppose you open a savings account that pays an annual interest rate of 8%. In addition, suppose you decide to deposit 
one dollar when you open the account, and you intend to double your deposit each year. Let B (k) be your balance after k years. B can be 

described by the relation B(k) = 1 .08 B - 1) + 2*, with S (0) = 1 . If, instead of doubling the deposit each year, you deposited a constant 

amount, q, the 2* term would be replaced with q, A sequence of regular deposits such as this is called an annuity. 

Retuming to the original situation, we can obtain a closed form expression for B*'"': 

(a) B<*'(/(:) = ^)i(1.08)* 

(b) B^P^k) should be of the form d 2* . 

(c) d2* = 1.08rf2*-i +2* 

^ (2rf)2*-' = 1.08d2*-i+ 2-2*-i 
=^ 2d = 1.08rf + 2 
=> .92d = 2 

d= 2.114 (to the nearest thousandth) 

Therefore B'^'CA:) = 2.174- 2* 

(d) B(0) = 1 ^ bi+ 2.174 = 1 

=> fei = -1.174 

B{k) = -1.174- 1.08*+ 2.174-2*. 

Example 83.13. Find the general solution to S (it) - 3S{k - V) - 4S(k - 2) = 4''. 

(a) The characteristic roots of the associated homogeneous relation are -1 and 4. Therefore, = bi(-Vf + b2 4*. 

(b) A function of the form d4* will not be a particular solution of the nonhomogeneous relation since it solves the associated homogeneous 
relation. When the right-hand side involves an exponential function with a base that equals a characteristic root, you should multiply your 

guess at a particular solution by k. Our guess at S'^\k) would then be <i A: 4* . See below for a more complete description of this procedure. 
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(c) Substitute d A: 4* into the recurrence relation for S (k): 

d 4* - 3 d (yfe - 1) 4*-i - 4 (it - 2) 4*-^ = 4* 

16 dk 4*-2 - 12 d (it - 1) 4*-2 -4d(k-2) 4*-^ = 4* 
Each term on the left-hand side has a factor of 4*"^ 

16dk - \2d(k - 1) - Ad{k - 2) = 4^ 
20d =16 ^ d = 0.8 

Therefore, 5<'''(it) = 0.8 it 4'' 

(d) The general solution to the recurrence relation is 

S{k) =bi(-lf+ ^)2 4* +0.8fe4* 
BASE OF RIGHT-HAND SIDE EQUAL TO CHARACTERISTIC ROOT 

If the right-hand side of a nonhoraogeneous relation involves an exponential with base a, and a is also a characteristic root of multipUcity p, 
then multiply your guess at a particular solution as prescribed in Table 8,3-2 by , where k is the index of the sequence. 

Example 83.14. 

(a) IfS(fe) - 9S(yt - 1) -I- 20S(yt - 2) = 2 -5*, the characteristic roots are 4 and 5. Sf'-'W will take the form d it 5*. 

(b) If Sin) - (>S{n - 1) -I- 9S{n - 2) = 3"*^ the only characteristic root is 3, but it is a double root (multiplicity 2). Therefore, the form 
of the particular solution is drP-3". 

(c) If (2(i) - <2(y - 1) - 12 <2(y - 2) = (-3y -I- 6-4^, the characteristic roots are -3 and 4. The form of the particular solution will be 
dij(-3y+d2j-4J. 

(d) U Sik) - 9S{k - I) + SS{k - 2) = 9k + 1 = (9 k + 1)1* , the characteristic roots are 1 and 8. If the right-hand side is a polyno- 
mial, as it is in this case, then the exponential factor 1* can be introduced. The particular solution will take the form kido + di k). 

We conclude this section with a comment on the situation in which the characteristic equation gives rise to complex roots. If we restrict the 

coefficients of our finite order linear relations to real numbers , or even to integers , we can still encounter characteristic equations whose roots 
are complex. Here, we will simply take the time to point out that our algorithms are still valid with complex characteristic roots, but the 
customary method for expressing the solutions of these relations is different. Since an understanding of these representations requires some 
background in complex numbers, we will simply suggest that an interested reader can refer to a more advanced treatment of recurrence 
relations (see also difference equations). 

EXERCISES FOR SECTION 8.3 
A Exercises 

Solve the following sets of recurrence relations and initial conditions: 



1. 


Sik) 


- lOSik - I) + 9Sik - 2) 


= 0, 5(0) = 3, 5(1) = 11 


2. 


S(.k) 


- 9 Sik - I) + nSik - 2) 


= 5(0) = 0, 5(1) = 3 


3. 


S(k) 


- 0.25 Sik - 1) = , 5(0) 


= 6 


4. 


Sik) 


- 20Sik - I) + liKlSik - 


2) = 0, 5(0) = 2, 5(1) = 50 


5. 


Sik) 


- 2Sik - I) + Sik - 2) = 


2 5(0) = 25, 5(1) = 16 


6. 


Sik) 


- Sik - I) - 6Sik - 2) = 


-30 5(0) = 7, 5(1) = 10 


7. 


Sik) 


- 5Sik - 1) = 5*, 5(0) = 


3 


8. 


Sik) 


- 55(fc - 1) -1- 65(fc - 2) 


= 2, 5(0) = -1, 5(1) = 


9. 


Sik) 


- 45(yt - 1) -1- 45(yt - 2) 


= 3k + 2K 5(0) = 1, 5(1) = 1 



10. 5(*:) = rSik - I) + a , 5(0) = 0, r, a > 0, ri= I 

11. 5(*:) - 45(*: - 1) - 11 5(*:- 2)+ 30Sik - 3) = 0, 
5(0) = 0, 5(1) = -35, 5(2) = -85 

12. Find a closed form expression for P ik) in Exercise 3 of Section 8.2. 

13. (a) Find a closed form expression for the terms of the Fibonacci sequence (see Example 8.1 .4). 
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(b) The sequence C was defined by = the number of strings of zeros and ones with length r having no consecutive zeros (Example 
8.2.1(c)). Its recurrence, relation is the same as that of the Fibonacci sequence. Determine a closed form expression for C,-, r > 1 , 

n 

14. If S(n) = 2 g(j), n> 1, then S can be described with the recurrence relation 5 (n) = 5 (n - 1) + g (n). For each of the following sequences 
that are defined using a summation, find a closed form expression: 

(a) Sin) = Zj, n^l 

J=i 

(b) Q(n) = E/, n>l 

(c) P(n) = zCt)', n^O 

id) T(n)= Zj\ n>l 
J=i 

B Exercises 

15. Let £) (n) be the number of ways that the set 1 1 , 2, n},n > 1, can be partitioned into two nonempty subsets. 

(a) Find a recurrence relation for D. (Hint: It will be a first-order linear relation.) 

(b) Solve the recurrence relation. 

16. If you were to deposit a certain amount of money at the end of each year for a number of years, this sequence of payment would be called 

an annuity (see Example 8.3.12,). 

(a) Find a closed form expression for the balance or value of an annuity that consists of payments of q dollars at a rate of interest of i. Note 
that for a normal annuity, the first payment is made after one year. 

(b) With an interest rate of 12.5%, how much would you need to deposit into an aimuity to have a value of one million dollars after 18 years? 

(c) The payment of a loan is a form of annuity in which the initial value is some negative amount (the amount of the loan) and the aimuity 
ends when the value is raised to zero. How much could you borrow if you can afford to pay $5,000 per year for 25 years at 14% interest? 

C Exercises 

17. Suppose that C is a small positive number. Consider the recurrence relation B(yt) - 2B(k - 1) + (l - C^)B(k-2) = with initial 

conditions B (0) = 1 and B(l) = 1 . If C is small enough, we might consider approximating the relation by replacing 1 - with 1 and with 
0. Solve the original relation and its approximation. Let Ba a be the solution of the approximation. Compare closed form expressions for B (k) 
and Ba(k). Their forms are very different because the characteristic roots of the original relation were close together and the approximation 
resulted in one double characteristic root. If characteristic roots of a relation are relatively far apart, this problem will not occur. For example, 
compare the general solutions of 

Sik) + 1.001 S(/t - 1) - 2.004002 5(/t - 2) = 0.0001 and 
Saik) + Saik - 1) - 2Sa(k - 2) = 0. 
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I 8.4 Some Common Recurrence Relations 

In this section we intend to examine a variety of recurrence relations that are not finite-order linear with constant coefficients. For each part of 
this section, we will consider a concrete example, present a solution, and, if possible, examine a more general form of the original relation. 

Example SA.l. Consider the homogeneous first-order linear relation S(_n) - nS(n - 1) = 0, n > 1, with initial condition 5(0) = 1. 
Upon close examination of this relation, we see that the nth term is n times the (n - 1)*' term, which is a property of n factorial. S (n) = n ! is 
a solution of this relation, for if n > 1 , 

S{n) = n! = n -(n - 1)! = n-S{n - 1). 

In addition, since ! = 1, the initial condition is satisfied. It should be pointed out that from a computational point of view, our "solution" 
really isn't much of an improvement since the exact calculation of n ! takes n - 1 multiplications. 

If we examine a similar relation, Gik) - l'^ G{k - 1),^>1 with G (0) = 1 , a table of values for G suggests a possible solution: 

A: 1 2 3 4 5 
G{k) 1 2 23 2* 2'° 215 

The exponent of 2 in G{k) is growing according to the relation E{k) = E{k - \) + k, with E(0) = 0. Thus E(k)= — - — and 

G (k) = 2*<*+i'/2 . Note that G (k) could also be written as 2° 2i 2^ • • -2*, for A: > 0, but this is not a closed form expression. 

In general, the relation P(n) = f (n) P (n - 1) forn > 1 with P (0) = / (0), where/is a function that is defined for aU n > 0, has the "solution" 

p(n)= nm 

k=0 

This product form of P (n) is not a closed form expression because as n grows, the number of multiplications grow. Thus, it is really not a true 
solution. Often, as for G (k) above, a closed form expression can be obtained from the product form. 

Example 8.4.2. Analysis of a Binary Search Algorithm. Suppose you intend to use a binary search algorithm (see Example 8.1 .3) on 
lists of zero or more sorted items, and that the items are stored in an array, so that you have easy access to each item. A natural question to 
ask is "How much time will it take to complete the search?" When a question like this is asked, the time we refer to is often the so - called 
worst - case time. That is, if we were to search through n items, what is the longest amount of time that we will need to complete the search? 
In order to make an analysis such as this independent of the computer to be used, time is measured by counting the number of steps that are 
executed. Each step (or sequence of steps) is assigned an absolute time, or weight; therefore, our answer will not be in seconds, but in 
absolute time units. If the steps in two different algorithms are assigned weights that are consistent, then analyses of the algorithms can be 
used to compare their relative efficiencies. There are two major steps that must be executed in a call of the binary search algorithm: 

(1) If the lower index is less than or equal to the upper index, then the middle of the list is located and its key is compared to the value that 
you are searching for. 

(2) In the worst case, the algorithm must be executed with a list that is roughly half as large as in the previous execution. If we assume that 
Step I takes one time unit and T (n) is the worst - case time for a list of n items, then 

Tin)= 1 + 7'(Ln/2J) for n > (8.4 a) 

For simplicity, we will assume that 

r(0) = (8.4 b) 

even though the conditions of Step 1 must be evaluated as false if n = 0. You might wonder why n/2 is truncated in 8.4a. If n is odd, then 

n = 2k + 1 for some k> 0, the middle of the list will be the (k + 1)*" item, and no matter what half of the list the search is directed to, the 
reduced list will have k = In/lj items. On the other hand, if n is even, then n = 2 ^ for ^ > . The middle of the list will be the k'^ item, and 
the worst case will occur if we are directed to the k items that come after the middle (the (k + If^ through (2 A:)* items). Again the reduced 
list has Li/2J items. 

Solution to 8.4 a and 8.4 b. To determine T in), the easiest case is when n is a power of two. If we compute T (2™), m > , by iteration, our 
results are 



T(\) = 


1 


+ 


T(0) 


= 1 


T(2) = 


1 


+ 


r(i) 


= 2 


T(4) = 


1 


+ 


7" (2) 


= 3 


r(8) = 


1 


+ 


T{4) 


= 4. 



The pattern that is established makes it clear that 7(2") = m + 1 . This result would seem to indicate that every time you double the size of 
your list, the search time increases by only one unit. 

A more complete solution can be obtained if we represent n in binary form. For each n > 1, there exists a non - negative integer r such that 

2'-i<n<2'' (8.4 c) 

For example, if n = 21, 2'' < 21 < 2'; therefore, r = 5. If n satisfies (8.4c), its binary representation requires r digits. For example, 21ten = 

lOlOltwo. 
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In general, n = (ai 02 ... ar)two- where ai = 1. Note that in this form, [n/l} is easy to describe: it is the r - 1 digit binary number 

(ai 02 ... ar-i)two 

Therefore, 

r(n) = T(ai 02 ...Or) 

= 1 + T{ai 02 ... Cr-i) 

= !+(!+ r(aia2...a,_2)) 

= 2+ T{ai 02... ar-2) 

= (r - 1) + T{ai) 

= (r - 1) + 1 since T(l) = 1 

= r 

From the pattern that we've just established, T (n) reduces to r. A formal inductive proof of this statement is possible. However, we expect 
that most readers would be satisfied with the argument above. Any skeptics are invited to provide the inductive proof. If n = 21 : 

T(2l) = r(lOlOl) 

= 1 + r(1010) 

= 1 + (1 + r(ioi)) 

= 1 + (1 + (1 + 7"(10))) 

= 1 + (1 + (1 + (1 + Tum 
= 1 + (1 + (1 + (1 + (1 + r(0))))) 

= 5 

Conclusion; The solution to 8.4 a and 8.4b is that for n > 1 , T{n) = r, where 2"^"' <n< 2''. 

A less cumbersome statement of this fact is that r(n) = Llog2 nj + 1. For example, r(21) = |.log2 21J + 1= 4 + 1= 5 . 
REVIEW OF LOGARITHMS 

Any discussion of logarithms must start by establishing a base, which can be any positive number other than 1 . With the exception of Theorem 
8.4.1, our base will be 2. We will see that the use of a different base (10 and e » 2.171828 are the other common ones) only has the effect of 
multiplying each logarithm by a constant. Therefore, the base that you use really isn't very important. Our choice of base 2 logarithms is 
convenient for the problems that we are considering. 

The base 2 logarithm of a positive number represents an exponent and is defined by the following equivalence for any positive real number a. 

log2 a = X «=> 2 * = a . 

For example, logj 8 = 3 because 2' = 8 and logj 1 .414 « 0.5 because 2"-^ * 1 .414 . A graph of the function / (x) = logj x in Figure 8.4.1a 
shows that if a < fc, the logjc < log2fc; that is, when x increases, iog2X also increases. However, if we move x from 2'" = 1024 to 
2" = 2048, log2X only increases from 10 to 11. This slow rate of increase of the logarithm function is an important point to remember. An 
algorithm acting on n pieces of data that can be executed in logj n time units can handle significantly larger sets of data than an algorithm that 

can be executed in n/ 100 or even -^An" time units (see Figure 8.4.1b). The graph of T(n) = Llog2 nj + 1 would show the same behavior. 
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1000 2000 



Figure 8.4.1 Log graphs 

A few more properties that we will use in subsequent discussions involving logarithms are summarized in the following theorem. 
Theorem 8.4.1. Fundamental Properties of Logarithms. Let a and b be positive real numbers, and r a real number. 



logz 1 = 




(8.4 d) 


logj ab = logj a 


+ log2 b 


(8.4 e) 




logjfe 


(8.4 f) 


logj a' = r log2 a 




(8.4 g) 


2'°h" = a 




(8.4 h) 



Returning to the binary search algorithm, we can derive the final expression for T{n) using the properties of logarithms, including that the 
logarithm function is increasing so that inequalities are maintained when taking logarithms of numbers. 

Tin) = r « l""-! <n<2'' 

log2 2''"' < logj n < log2 2^ 
<t=> r - 1 < logj n<r 
<=> r - 1 = Llog2 "J 
« T(n) = r = Llog2 nj + 1 

We can apply several of these properties of logarithms to get an alternate expression for T{n): 

Llog2 nj + l = Llog2 « + IJ 

= Llog2n + log2 2J 
= Llog2 2nJ 

Definition: Logarithms base b, If b > 0, b ^ 1, then for a >0, 
logi^ a = X <=> If = a 
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Theorem 8.4 J. Let b > 0, b ^ I. Then for all a > 0, log^^a = - — -. Therefore, if b > 1 , base b logarithms can be obtained from base 2 
logarithms by dividing by the positive scaling factor log2 b. Ifb < 1, this scaling factor is negative. 

Proof: By an analogue of 8.4f, a = b^^t". Therefore, if we take the base 2 logarithm of both sides of this equality we obtain: 
log^a = \og,{h"'t2'-) 
=^ log2 a = logj a logj b 
To obtain the desired result, divide both sides of this last equation by logj b. ■ 
Note:log2l0 « 3.32192 and logje = 1.55269. 

If the time that was assigned to Step 1 of the binary search algorithm is changed, we wouldn't expect the form of the solution to be very 

different. If 7"(n) = a + T (Ln/2 J) with 7" (0) = c,then7"(n) = c + aLlog2 2nJ. 

A further generalization would be to add a coefficient to T (ln/2 J): T(n) = a + bT (ln/2 J) with T (0) = c, where a, b, c e R, and hi^Ois 
not quite as simple to derive. First, if we consider values of n that are powers of 2: 

r(l) = a + hTiO) = a + he 

Til) =a + b(a + bc) = a + ab +cb^ 

r(4) =a + b{a + ab + cb'^) = a + ab + ab^ + cb^ 

7(20 = a + ab + ab'^ + ••• + ab' + cb'*^ 
If n is not a power of 2, by reasoning that is identical to what we used to solve 8.4a and 8.4b, 

Tin) = Y.ah'' + cb'*^ 

k=0 

where r = [log, n\ . 

The first term of this expression can be written in closed form. Let x be that sum: 

X = a + ab + ab^ + ■■■ + a¥ 
bx= ab + ab^ + ■■■ + ay + ay*^ 
We've multipUed each term of by and aUgned the identical terms in x and bx. Now if we subtract the two equations, 

x-bx = a- ab''^^ 
xil -b) = ail - 
Therefore, 

X = a 

b-l 

A closed form expression for T in) is 

Tin) = a ~~~ + cV^^ where r = [logj nj 

Example 8.4.3. The efficiency of any search algorithm such as the binary search relies on fact that the search list is sorted according to a key 
value and that the search is based on the key value. There are several methods for sorting a list. One example is the bubble sort. You might be 
familiar with this one since it is a popular "first sorting algorithm." A time analysis of the algorithm shows that if B in) is the worst-case time 
needed to complete the bubble sort on n items, then Bin) = (n - 1) + Bin - 1) and B(l) = 0. The solution of this relation is a quadratic 

function Bin) = ^ (n^ - n). The growth rate of a quadratic function such as this one is controlled by its squared term. Any other terms are 

dwarfed by it as n gets large. For the bubble sort, this means that if we double the size of the Ust that we are to sort, n changes to 2 n and so n^ 
becomes 4n^ . Therefore, the time needed to do a bubble sort is quadrupled. One alternative to bubble sort is the merge sort. Here is a simple 
version of this algorithm for sorting F = (r(l), r(2), rin)],n > l.lfn = 1, the Ust is sorted trivially. If n > 2then: 

(1) Divide F into Fi = (r(l), r(Ln/2J)) and Fj = {r(Ln/2J + 1), r(n)). 

(2) Sort Fi and F2 using a merge sort. 

(3) Merge the sorted lists Fi and F? into one sorted list. If the sort is to be done in descending order of key values, you continue to choose the 
higher key value from the fronts of Fi and F2 and place them in the back of F. 

Note that Fi will always have L«/2J items and F2 wiU have ln/2'\ items; thus, if n is odd, F2 gets one more item than Fi. We will assume that 
the time required to perform Step 1 of the algorithm is insignificant compared to the other steps; therefore, we wiU assign a time value of zero 
to this step. Step 3 requires roughly n comparisons and n movements of items from Fi and F2 to F; thus, its time is proportional to n. For this 
reason, we will assume that Step 3 takes n time units. Since Step 2 requires r([n/2J) + r(|"n/2]) time units. 
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Tin) = n + r(Ln/2J) + nfnm) (8.4i) 
with the initial condition 

r(l) = 0. (8.4j) 
Instead of an exact solution of 8.41 and 8.4j, we will be content with an estimate for T(n). First, consider the case ofn = 2'',r > 1: 

r(2') = T(2) = 2 + T(l) + T(l) =2 = 1-2 
T(2?) = r(4) = 4 + T(Tj + T(2) = 8 = 2-4 
T(2^) = r(8) = 8 + r(4) + r(4) = 24 = 3 - 8 

r(20 = r2''= 2''log2 2'' 

Thus, if n is a power of 2, T(n) = n log2 n. Now if, for some r > 2, 2''"' < n < 2'' , then (r - 1) 2''"^ < r(n) < r T. This can be proven by 
induction on As n increases from 2^~^ Xolf ,T (n) increases from (r - 1) 2''~'to r 2^ and is slightly larger than \n logj n\ . The discrepancy is 
small enough so that Tdn) = In logj nj can be considered a solution of 8.4i and 8.4j for the purposes of comparing the merge sort with other 
algorithms. Table 8.4.1 compares B(n) with Te(_n) for selected values of n. 

Table 8.4.1 

Comparison of Times for Bubble Sort and IWerge Sort 



n 


Bin) 


Tein) 


10 


45 


34 


50 


1225 


283 


100 


4950 


665 


500 


124750 


4483 


1000 


499500 


9966 



Definition: Derangement. A derangement of a set A is a permutation of A (i.e., a bijection from A into A) such that f{a) i=a for all 
a £ A. 

Example 8.4.4. If A = 1 1 , 2, . . . , «), an interesting question might be "How many derangements are there of A?" We know that our answer is 
bounded above by n !. We can also expect our answer to be quite a bit smaller than n ! since n is the image of itself for (n - 1) ! of the permuta- 
tions of A. 

Let D (n) be the number of derangements of |1 , 2, . . . , n). Our answer will come from discovering a recurrence relation on D. Suppose that 
n > 3. If we are to construct a derangement of |1, 2, ... , n],f, then f (ti) = k ^n. Thus, the image of n can be selected in n - 1 different 
ways. No matter which of the n - I choices we make, we can complete the definition of /in one of two ways. First, we can decide to make 
f (k) = n, leaving D(n — 2) ways of completing the definition of/, since /will be a derangement of {1, 2, n) - [n, k). Second, if we 
decide to select f(k)+ n, each of the D(n - 1) derangements of {1, 2, ...,n - 1} can be used to define /. If g is a derangement of 
{1, 2, n - 1} such that g (p) = fc, then define f by 

n if j = p 
fU) = { k ifj = n 

g(j) otherwise 

Note that with our second construction of/, / (/ («)) = f(k) + n, while in the first construction, / (/ (n)) = f (k) = n. Therefore, no derange- 
ment of {1, 2, n}with/(n) = can be constructed by both methods. 

To recap our result, we see that /is determined by first choosing one of n - 1 images of n and then constructing the remainder of /in one of 
D{n - 2) + D{n - I) ways. Therefore, 

D{n) = {n - V){D{n - 2) + D{n - 1)). (8.4k) 

This homogeneous second-order linear relation with variable coefficients, together with the initial conditions D(\) = and D(2) = 1, 
completely defines D. Instead of deriving a solution of this relation by analytical methods, we will give an empirical derivation of an approxima- 
tion of D (n). Since the derangements of {1, 2 ... , n} are drawn from a pool of n ! permutations, we wiU see what percentage of these permuta- 
tions are derangements by listing the values of n !, D (n), and The results we obtain (see Table 8.4.2) indicate that as n grows, hardly 

changes at all. If this quotient is computed to eight decimal places, for n > 12, D in)/n ! = 0.36787944. The reciprocal of this number, which 
D{n)ln\ seems to be tending toward, is, to eight places, 2.71828182. This number appears in so many places in mathematics that it has its own 

name, e. An approximate solution of our recurrence relation on D is then Din) « — 
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n 


D(n) 




1 


U 


u 


z 


1 
1 


A CAAAAAAA 
U .DUUUUUUU 




L 


A 'X'X'X'X'X'X'X'X 
\J .J J J J J J J J 


A 


Q 


\).o 1 z>\j\j\j\j\j 


c 

J 


44 


U.jOOOOOO / 





ZOj 




/ 






Q 

o 


1'+ OJJ 




9 


133496 


0.36787919 


10 


1334961 


0.36787946 


11 


14684570 


0.36787944 


12 


176214 841 


0.36787944 


13 


2290792932 


0.36787944 


14 


32071 101049 


0.36787944 


15 


481066515734 


0.36787944 



Table %A2 

D in) compared to n ! 



EXERCISES FOR SECTION 8.4 
A Exercises 

1. Solve the following recurrence relations. Indicate whether your solution is an improvement over iteration. 

(a) nSin) - Sin - 1) = 0, 5(0) = 1. 

(b) Tik) + ^kT{k - I) = 0,r(0) = 1. 

(c) U{k) - U{k - 1) = 0,k > 2, U(\) = 1. 

2. Prove that if n > 0, ln/2\ + [n/21 = n. (Hint: Consider the cases of n odd and n even separately.) 

B Exercises 

3. Solve as completely as possible: 

(a) T{n) = 3 + T {[n / 2]) , T (0) = 0. 

(b) Tin) = I + '-Tiln/2i), 7(0) = 2. 

(c) Vin) = 1 + VLn/8J),V(0) = 0. (Hint: Write n in octal form.) 

4. Prove by induction that if r(n) = 1 + T (Ln/2 J), T (0) = 0, and 2''-' < n < 2" , r > l,then7'(n) = r. (Hint: Prove by induction on r.) 

5. Use the substitution S (n) = r(n + 1) /r(n) to solve 

r(n) r(n - 2) - r(n)2 = 1 forn > 2,withr(0) = l,r(l) = 6,andr(n) > 0. 

6. Use the substitution G(n) = T(n)^ to solve 

r(n)^ - r(n - 1)2 = 1 for n > 1, with r(0) = 10. 

7. Solve as completely as possible: 

(a) e(n) = 1 + e([V^J),n > 2, gd) = 0. 

(b) R(n) = n +i?(Ln/2J),n > l,i?(0) = 0. 

8. Suppose Step 1 of the merge sort algorithm did take a significant amount of time. Assume it takes 0.1 time unit, independent of the value of 
n: 

(a) Write out a new recurrence relation for T (n) that takes this factor into account; 

(b) Solve for 7(2''), r > 0; 

(c) Assuming the solution for powers of 2 is a good estimate for all n, compare your result to the solution in the text. As gets large, is there 
really much difference? 
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I 8.5 Generating Functions 

This section contains an introduction to the topic of generating functions and how they are used to solve recurrence relations, among other 
problems. Methods that employ generating functions are based on the concept that you can take a problem involving sequences and translate it 
into a problem involving generating functions. Once you've solved the new problem, a translation back to sequences gives you a solution of the 
original problem. 

This section covers: 

(a) The definition of a generating function; 

(b) Solution of a recurrence relation using generating functions to identify the skills needed to use generating functions; 

(c) An introduction and/or review of the skills identified in point b; 

(d) Some applications of generating functions. 

Definition: Generating Function of a Sequence. The generating Junction of a sequence S with terms Sg, 81,82, ... is the infinite sum 

00 

G(S; z) = E S„ z" = 5o + Si z + S2Z-+SiZ^ + - 

n=0 

The domain and codomain of generating functions will not be of any concern to us since we will only be performing algebraic operations on 
them. 

Example 8.5.1. 

(a) If 5„ = 3",n > 0,then 

G(S;z) = l+ 3z + 9z^ + 21 z^ + ••• 
= Z3"z" 

n=0 

CO 

= ZOz)" 

n=0 

We can get a closed form expression for G {8; z) by observing that G{S,z) - 3 z G(S; z) = 1 . Therefore, 
G{8;z) =-V- 

l-3z 

(b) Finite sequences have generating functions. For example, the sequence of binomial coefficients C(n; 0), C(n; 1), ...,C(n; n), n > 1 
has generating function 

G(C(n; •); z) = C{n; 0) + C(n; l)z + ••• + Cin; n)z" 
= T,C(n-k)z'' 
= (1+Z)" 

by application of the binomial formula. 

(c) If e(n) = 

G(Q;z)= Y,n^z"= E V 

Note that the index that is used in the summation has no significance. Also, note that the lower limit of the summation could start at 1 since 
G(0)=0. 

SOLUTION OF A RECURRENCE RELATION USING GENERATING FUNCTIONS 

Problem: Solve ^(n) - 28{n - 1) - 3S(n - 2) = 0,n > 2, with S(0) = 3andS(l) = 1. 

(1) Translate the recurrence relation into an equation about generating functions. 

In our example, let V(n) = 8{n) - 28{n - V) - 3 8{n - 2),n > 2,withV(0) = and V(l) = 0. Therefore, 

00 

G{V\z) = + Oz + Z iS{n) - 28{n - 1) - 3S(n - 2))z" = 

CO 

(2) Solve for the generating function of the unknown sequence, G(8,z) = Z-SnZ". 

n=0 

00 

= Z i8{n) - 28{n - 1) - 38{n - 2))z» 

n=2 

= i; 5(«)z" - 2[ i; 5(« - l)z"] - 3 [ i; 5(« - 2)z"] 

n=2 \n=2 ) \n=2 ) 
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Close examination of the three sums above shows: 

00 oo 

(a) ZS„z"= ZSnf - S(0)-S(l)z 

n=2 n=0 

= G{S; z) - 3 - z 
smce5'(0) = 3and5'(l) = l. 

OO / 00 \ 

(b) Z5(n-l)z" = Z 

n=2 V„=2 / 

= z(s5(n)z"] 

= z(^Z5(n)z"-5(0)) 
= z(G(S; z) - 3 

OO / oo A 

(c) Z S(n - 2) z" = zH Z S(n - 2) f-^ 

n=2 \n=2 ) 

= ^ G{S; z) 

Therefore, 

(G(5; z) - 3 - z) - 2 z(G(5; z) - 3) - 3 z^ 0(5; z) = 
=> G(5;z)-2zG(S;z)-3z2G(S;z) = 3 - 5z 

=> G(S;z) = ■ 



l-2z-3z^ ' 

(3) Determine the sequence whose generating function is the one obtained in Step 2, 

For our example, we need to know one general fact about the closed form expression of an exponential sequence (a proof wiU be given later): 
T(n)=ba" , n>0 « G(T; z) = — (8.5a) 

J~az 

Now, in order to recognize S in our example, we must write our closed form expression for G (S; z) as a sum of terms like G (T; z) above. Note 
that the denominator of G (S; z) can be factored: 

i-2z-3z- (l-3;)(l+z) 

If you look at this last expression for G (5; z) closely, you can imagine how it could be the result of addition of two fractions, 

= ^ + ^ (8.5b) 

where A and B are two real numbers that must be determined. Starting on the right of 8.5b, it should be clear that the sum, for any A and B, 
would look like the left-hand side. The process of finding values of A and B that make 8.5b true is called the partial fractions decomposition of 
the left-hand side: 

A _^ fl _ A(l+z) , S(l-3z) 



l-3z 1+z (l-3z)(l+z) (l-3z)(l+z) 
_ (A+B)+(A-3B)z 
(1-3 z) (1+z) 

Therefore, 

I A+B = 3 \ /"^^H 
lA-3B = -5/ ^ \B = 2l 

and 

G(S; z)== --^+ 

1-3 z 1+z 

We can apply 8.5a to each term of G(S;z): 

is the generating function for Si(n) = 1 • 3" = 3" and 

2 

— is the generating function for S2(n) = 2 (-1)". 
Therefore, S(n) = 3" +2 (-1)". 
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From this example, we see that there are several skills that must be mastered in order to work with generating functions. You must be able to: 

(a) Manipulate summation expressions and their indices (in Step 2). 

(b) Solve algebraic equations and manipulate algebraic expressions, including partial function decompositions (Steps 2 and 3). 

(c) Identify sequences with their generating functions (Steps 1 and 3). 

We win concentrate on the last skill first, a proficiency in the other skills is a product of doing as many exercises and reading as many examples 
as possible. 

First, we must identify the operations on sequences and on generating functions. 

Operations on Sequences: Let 5 and T be sequences of numbers and let c be a real number. Define the sum S + T, the scalar product c S, the 
product Sr, the convolution S*T, the pop operation ST (read "S pop"), and the push operation 54- (read "S push") term- wise for ^ > by 

(S + T)(k) = S(k) + T(k) 
(cS)(k) = cS(k) 
(ST)(k) = S(k)T(k) 

(S*T)(k) = Y.S{j)T{k-j) 



iSt)ik) = S(k+1) and 
(5i)(^) = {_ 



ifk = 
.S(k-l) itk>0' 

If one imagines a sequence to be a matrix with one row and an infinite number of columns, S + T and c S are exactly as in matrix addition and 
scalar multiplication. There is no obvious similarity between the other operations and matrix operations. 

The pop and push operations can be understood by imagining a sequence to be an infinite stack of numbers with S (0) at the top, 5(1) next, etc., 
as in Figure 8.5.1a. The sequence 5t is obtained by "popping" S(0) from the stack, leaving a stack as in Figure 8.5.1b, with S(l) at the top, S(2) 
next, etc. The sequence S I is obtained by placing a zero at the top of the stack, resulting in a stack as in Figure 8.5.1c. Keep these figures in 
mind when we discuss the pop and push operations. 



SiO) 




5(1) 







S(l) 




S(2) 




SiO) 


S(2) 




S(3) 




S(l) 


S(3) 




S(4) 




S(2) 


S(4) 




S(5) 




5(3) 












(a) 




(c) 



FIGURE 8.5.1 

Stack interpretation of pop and push operation 

Example If 5 (n) = n,T(n) = rp-, U{n) = 2", and i?(n) =n2". 



(a) (S + T)(n) = n + 




(b) (U + R) (n) = 2" + n 2" 


= (1 + n) 2" 


(c) (2 60 (n) = 2-2"= 2"+i 




(d) C-R)(.n) = in2«=n2'- 


-1 


(e) (ST)(n) =nrp- = n^2 = 


n3 


(f) {S*T){n)= Y,S(j)T{n- 


;•)= Z7(n-y-)' 



J=0 j=0 

= Z(jn^-2nf + f) 



Zj-2n E/ + Er 

j=0 j=0 ;=0 



6 ' 4 

by Exercise 14 of Section 8.3. 



nHn+l) («-l) 
12 
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(g) (U*U)(n) = I,UU)U(n-J) 

J=o 

= (n+ 1)2" 

(h) (Sr)(n) = n + 1 

(i) (5i)(n) = max(0, «- 1) 
Q)((Sl)l)(n)= max(0, n - 2) 

2"-i ifn>0 
= 

a) aUl) t) (n) = (I/i) (n + 1) = 2" = Uin) 



X (2" ' ifn>( 
(k) (J/4.) (n) = 

^ rfn = ( 



f if n = 
m ((i/T)i)(n) = i „ , f 

I. U(n) if n > 

Notethat(i/4.)T + (i/t)4. 

Definition: Multiple Pop and Push: IfS is a sequence of numbers, define 

Sfp = (S-f(p - 7))-f ;/ p > 2 and St 1 = St. 
Similarly, define 

SiP = iS[{.p - l))iif p >2andSil =54.. 
Notice that 

(5T2)(it)= ((STl)T)(jfe) = ((ST)t)(fe)= (ST)(i(:+ 1)= i(: + 2 

In general, 

(S ^p)(k) = S{k + p), and 

iik<p 
S{k-p) itk>p 

Operations on Generating Functions 

If 

oo oo 

Giz) = E at and Hiz) = I,h^ 

k=0 k=0 

are generating functions and c is a real number, then the sum G + H, scalar product c G, product GH, and monomial product z'' G, p > 1 are 
generating functions, where 

oo 

(G + H)(,z)= Zia^ + b^)^ 

k=0 

00 

(cG)(z)= Zca^z* 

k=0 

oo 

(G /f) (z) = Z c z* where = Z 

k=0 J=o 

iz" G) (z) = z''la,z'=la, 2*+" = i; a„_p z" 

*:=0 *:=0 «=P 

The last sum is obtained by substituting n - p for ^ in the previous sum. 
Example 8.5.3. If 

00 oo 

D(z) = Z ^ z* andH(z) = Z 2*z* 

k=0 /fc=0 

then 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed imder a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



164 



Chapter 8 - Recursion and Recurrence Relations 



(D + H)(.z) = E(fc + 2*)z* 

k=0 



{2H){z)= Z2-2*=z* 

k=0 



(zD)(z) = z Zitz*= Z*:z*""= 

k=0 k=0 k=\ 

DO 



■X, ( k 

(DH)(z)= S S7 2*-^' 

CO t k 

(.HH)(,z)= E E2'2*-' 



E(A;+l)2*z* 



Note: D(z) = G(5; z), and (z) = G(?7; z) from Example 8.5.2. Now we establish the connection between the operations on sequences and 
generating functions. Let S and Tbe sequences and let c be a real number; 

GiS + T;z) = GiS;z) + GiT;z) (8.5c) 

GicS;z) = cGiS;z) (8.5rf) 

G(S*T;z) = GiS;z)GiT-z) (8.5 

G(5T;z) = (G(5;z)-5(0))/z (8.5/) 

G(5-l;z) = zG(5;z) (8.5 g) 

In words, 8.5c says that the generating function of the sum of two sequences equals the sum of the generating functions of those sequences. 
Take the time to write out the other four identities in your own words. From the previous examples, these identities should be fairly obvious, 
with the possible exception of the last two. We will prove 8.5f as part of the next theorem and leave the proof of 8.5g to the interested reader. 
Note that there is no operation on generating functions that is related to sequence multiplication; that is, G (5 T; z) cannot be simplified. 

Theorem 8^.1. If p > 1, 



\GiS;z) - ZS(k)^ 
\ k=o 



z* 



(a) G(Stp;z) 

(b) GiSip;z) =zPG{S;z). 

Proof of Part (a): The proof is by induction. Basis: 



G(St, z) = E S(k + 1) z* = E S{k) z''-^ 

k=0 k=l 



= \ ZS(k)z' 



\S(0)+ ZS(k)z''-S(0) 

k=l J 

= (G(S;z)-Sm/z 
Therefore, part (a) is true for p = 1. 



Induction, Suppose that for some p> 1, the statement in part (a) is true: 

G(St(p + i);z)= G((5tp)T;z) 

= (G(5tp;z)-(5tp)(0))/z bythebasis 



-S{p) 



by the induction hypothesis. Now write S(j>) in the last expression above as (S(r) z'')lz^ so that it fits into the finite summation: 



G(St(p + l);z) = 



= [g(S;z)- E 

V t=0 



S{)Oz'\ z 



k / .p+i 
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Therefore the statement is true f or p + 1 . ■ 



CLOSED FORM EXPRESSIONS FOR GENERATING FUNCTIONS 

The most basic tool used to express generating functions in closed form is the closed form expression for the geometric series, which is an 
expression of the form a + ar + ar^ + -".It can either be terminated or extended infinitely. 

Finite Geometric Series: a + ar + ar^ + ■■■ + ar" = a| ' j (8.5h) 

Infinite Geometric Series: a + ar + ar^ + ■■■ = (8.5i) 
Restrictions: a and r represent constants and the right sides of the two equations apply under the following conditions: 

(1) r must not equal lin the finite case. Note that a + ar + •••ar" = (n+ l)aif r = 1. 

(2) In the infinite case, the absolute value of r must be less than 1 . 

These restrictions don't come into play with generating functions. We could derive 8.5h by noting that if S(n) = a + ar + ••• + ar",n > 0, 
then S (n) = r S (n - 1) + a (See Exercise 10 of Section 8.3). An alternative derivation was used in Section 8.4. We wiU take the same steps 
to derive 8. 5i. Let 

X = a + ar + ar^ + ■■■ 
rx = ar + a r^ + ■■■ 

X - r X = a 

Therefore, X = — . 

l-r 

Example 8.5.4. 

(a) lf5(n) = 9-5", n > 0, G (5; z) is an infinite geometric series with a = 9andr = 5z. Therefore, 

G(5;z)=^. 

(b) If T(n) = 4,n > 0,then G(T; z) = 4/(1 - z). 

(c) If i/(n) = 3(-l)",thenG(I/;z) = 3/(1 + z). 

(d) LetC(n) = S(n) + T(n) + U(n) = 9 • 5 + 4 + 3(-l)''. Then 

G(C;z) = G(5;z) + C(r;z) + G(i/; z) 

_ 9 4 3 

~ 1-5 z 1-z 1+z 
_ 14z^+34z-16 

Given a choice between the last form of C (C; z) and the previous sura of three fractions, we would prefer leaving it as a sura of three 
functions. As we saw in an earlier exaraple, a partial fractions decoraposition of a fraction such as the last expression requires sorae effort to 
produce. 

17 

(e) If G(Q; z) = 34/(2 - 3z), then Q can be determined by multiplying the numerator and denominator by 1/2 to obtain — —. We recog- 

3 

nize this fr-action as the sum of the infinite geometric series with a = 17 and r = - z. Therefore Q(n) = 17 (3/2)". 

(f) If C(A;z) = (1 + zf , then we expand (1 + zf to / + 3z + 3 z^ + z^ . Therefore A(0) = 1, A(l) = 3 A(2) = 3, A(3) = 1, and, 
since there are no higher-powered terms, A (n) = 0, n > 4. A more concise way of describing A is A (k) = C(3; k), since C(n; k) is usually 
interpreted as of ^ > n. 

Table 8.5.1 contains some closed form expressions for the generating functions of some common sequences. 



Sequence Generating Function 

S(k) = b(^ G(S;z)= — 

1-az 

S{k) = k G(S;z) = ^ 

abz 



S{k) = bka' G(S;z): 

(l-oz) 

SUk) = ^ G(5; z) = 

|C(n.) O.k.n ^s;z) = a^zr 
l k> n 

Table SS.l 
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Closed Fomi Expressions of some Generating Functions 

Example SSS. Solve S(k) + 3S(k - I) - 4S(_k -2) = 0,k>2, with S(P) = 3 and S (1) = -2. The solution wiU be obtained using 
the same steps that were used earlier in this section, with one variation. 

(1) Translate to an equation about generating functions. First, we change the index of the recurrence relation by substituting n + 2 for k. The 
result is 5(n + 2) + 35(n + 1) - 45(n) = 0, n > 0. Now, if V(n) = S(n + 2) + 35 (n + 1) - 4S(n), then V is the zero sequence, 
which has a zero generating function. Furthermore, V = 5't2 + 3(5't)-45'. Therefore, 

= G(y;z) 

= G(5T2; z) + 3G(5t;z) - 4G(S;z) 

G(fe)-.S(0)-S(l)z _^ ^ iG(Sx)-Sm _ 4 ^) 
^ z ' 

(2) We want to now solve the following equation for G (S; z): 

G{S\z)-S{0)-Smz ^ (.G{S;z)-Sm _ ^ ^ q 

z^ z ' 

Multiply by z^ : 

G{S;z) - 3 + 2z + 3z(G(5';z) - 3) - 4z^G(5;z) = 
Expand and collect all terms involving G{S; z) on one side of the equation: 

G{S;z) + 3zG(S;z) - 4^G{S;z) = 3 + 7z 
(1 + 3z - 4z^)G(S;z)= 3 + 7z 
Therefore, 

(3) Determine S from its generating function. 

1 + 3z - 4z== = (1 + 4z)(l - z) 
thus a partial fraction decomposition of G (S; z) would be: 



A _B Az-A-4Bz-B 

l+4z 1-z ~ (z-l)(4z+l) 

_ (A+B)+(4B-A)z 
fe-l)(4z+l) 

Therefore, A + B = 3 and 4 B - A = 7 . The solution of this set of equations is A = 1 andB = 2. 

2) = T^- + 7" 

l+4z 1-z 

is the generating function of 5i(n) = (-4)", and 

2 

- — is the generating function of = 2(1)" = 2. 
In conclusion, since GCSiz) = G(Si;z) + G{S2;z),S{n) = 2 + (-4)". 

Example 8^.6. Let A = [a, b, c, d, e] and let A* be the set of all strings of length zero or more that can be made using each of the 
elements of A zero or more times. By the generaUzed rule of products, there are 5" such strings that have length n,n>0. Suppose that X„ is 
the set of strings of length n with the property that all of the a's and b's precede all of the c's, <fs, and e's. Thus aaabde e Xg, but 
abcabc ^ Xj. Let R(n) = \X„\. A closed form expression for if can be obtained by recognizing R as the convolution of two sequences. To 
illustrate our point, we will consider the calculation of R (6). 

Note that if a string belongs to Xg, it starts with k characters from [a, b] and is followed by 6 - ^ characters from {c, d, e). Let S (k) be the 
number of strings of a's and fo's with length k and let T (k) be the number of strings of c's, rf's, and e's with length k. By the generalized rule of 

products, S (k) = 2* and T (k) = 3*. Among the strings in X(, are the ones that start with two a's and fo's and end with c's, d's, and e's. There 
are S (2) T (4) such strings. By the law of addition, | X^ | = R (6) = 5 (0) T (6) + S(l) 7(5) + ■■■ + S(5) 7(1) + S (6) 7(0). Note that the sixth 
term of R is the sixth term of the convolution of S with 7, S*7. Think about the general situation for a while and it should be clear that 
R =S*7. Now, our course of action will be to: 

(a) Determine the generating functions of S and 7, 

(b) Multiply G (5; z) and G (7; z) to obtain G (S * T; z) = G (R; z) (by 10.5e), and 

(c) Determine if on the basis of G (if ; z) . 



oo . oo . 

(a) G(S; z) = Z 2*z* = tt - and G(.7; z) = Z 3*z* = — 

k=o ^ k=o 

(b) G(R;z) = G(S;z)G(7;z) = ' 



(l-2z)(l-3z) 
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(c) To recognize R from G (R; z), we must do a partial fractions decomposition: 

1 A_ _^ B _ -3Az+A-2Bz+B _ {A+B)+{-3A-2B)z 

(l-2z)(l-3z) ~ l-2z l-3z ~ (2z-l)(3z-l) ~ (2z-l)(3z-l) 

Therefore, A + B = land-3A - 2B = 0. The solution of this pair of equations is A = -2andB = 3. 
Since 

-2 3 

G(R;z) =-^+-^, 

l-2z l-3z 

which is the sum of the generating functions of -2 (2)* and 3 (3)*, 

Rik) = -2 (2)* + 3 (3)* = 3*+' - 2*+i 
For example, (6) = 3^-2' = 2187 - 128 = 2059. Naturally, this equals the sum that we get from (S*T)(6). To put this number in 
perspective, the total number of strings of length 6 with no restrictions is 5^ = 15625, and -77^ « 0.131776. Therefore approximately 13% 

15625 

of the strings of length 6 satisfy the conditions of the problem. 
EXTRA FOR EXPERTS 

The remainder of this section is intended for readers who have had, or who intend to take, a course in combinatorics. We do not advise that it be 
included in a typical course. The method that was used in Example 8.5.6 is a very powerful one and can be used to solve many problems in 
combinatorics. We close this section with a general description of the problems that can be solved in this way, followed by some examples. 

Consider the situation in which Pj, P2, Pm aie m actions that must be taken, each of which results in a well-defined outcome. For 
each A: = 1,2, . . . , m define X/^ to be the set of possible outcomes of P/^ . We wiU assume that each outcome can be quantified in some way 
and that the quantification of the elements of Xj^ is defined by the function : {0, 1,2, . . .}. Thus, each outcome has a non-negative 

integer associated with it. Finally, define a frequency function F;^ : {0, 1, 2, . . .} -» {0, 1, 2, . . .} such that Fi^(_n) is the number of elements 
of that have a quantification of n. 

Now, based on these assumptions, we can define the problems that can be solved. If a process P is defined as a sequence of actions 
Pi, P2, Pm as above, and if the outcome of P, which would be an element of Xi XX2X ■■■ xX„, is quantified by 

m 

2(01,02, a„)= ZQkiCk)' 

k=l 

then the frequency function, F, for P is the convolution of the frequency functions for Pi, P2, Pm, which has a generating function equal to 
the product of the generating functions of the frequency functions Fi, F2 F„. That is, 

G(F; z) = G(Fi; z) G(F2; z) -GiF^; z) (8.5j) 

Example 8.5.7. Suppose that you roll a die two times and add up the numbers on the top face for each roll. Since the faces on the die represent 
the integers 1 through 6, the sum must be between 2 and 12. How many ways can any one of these sums be obtained? Obviously, 2 can be 
obtained only one way, with two I's. There are two sequences that yield a sum of 3: 1-2 and 2-1 . To obtain all of the frequencies with which the 
numbers 2 through 12 can be obtained, we set up the situation as follows. For ; = 1, 2; Pj is the rolling of the die for the ;* time. 
Xj = {1, 2, 6) and Qj : Xj— > {0, 1, 2, 3, ...) is defined by Qj(x) = x. Since each number appears on a die exactly once, the 
frequency function is Fj0c) = lifl < k < 6, and Fj(k) = otherwise. The process of rolling the die two times is quantified by adding up the 
Qj' s; that is, Q(ai, 02) = Qi (ai) + 62(02) • The generating function for the frequency function of rolling the die two times is then 

G(F; z) = G(Fi;z)G(F2;z) 

= [z''+z' + z'' + z^+z^+zf 

= {z^^ + 2z^^ +3z^° + 4z^ + 5^ +6z'' + 5z^ +4z' + 3z'^ + 2z^ + z^) 

Now, to get F(k),just read the coefficient of z*. For example, the coefficient of z' is 4, so there are four ways to roll a total of 5 . 

To apply this method, the crucial step is to decompose a large process in the proper way so that it fits into the general situation that we've 
described. 

Example 8.5.8. Suppose that an organization is divided into three geographic sections, A, B, and C. Suppose that an executive committee of 11 
members must be selected so that no more than 5 members from any one section are on the committee and that Sections A, B, and C must have 
minimums of 3, 2, and 2 members, respectively, on the committee. Looking only at the number of members from each section on the commit- 
tee, how many ways can the committee be made up? One example of a valid committee would be 4 A's, 4 B's, and 3 C's. 

Let Pa be the action of deciding how many members (not who) from Section A will serve on the committee. Xa = {3, 4, 5) and QA(k) = k. 

The frequency function, Fa , is defined by Fa(^) = 1 if ^ e X^- , with Fa(^) = otherwise. G(Fa; z) is then z^ + z* + z^ . Similarly, 

G(Fb;z) =z^+ z'+ z'* + z'= GiFc;z). Since the committee must have 11 members, our answer wiU be the coefficient of z'' in 
G(Fa; z) G(Fb; z) G(Fc; z), which is 10: 
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G{Fa; z) G{Fb; z) G{Fc; z) = [z^ + + z') [z^ + z^ + z^ + z') 

= z" + 3 z"* + 6 z" + 9 z'2 + 10 z" + 9 z'" + 6 z' + 3 z« + z^ 

EXERCISES FOR SECTION 8.5 
A Exercises 

1 . What sequences have the following generating functions? 

(a) 1 

(b) f 

(c) 1 + z 

(d) — + — 

^ ' l+2z l-3z 

2. What sequences have the following generating functions? 

(a) 

(c) - + - 

^ ' l-z 1+z 

B Exercises 

3. Find closed form expressions for the generating functions of the following sequences: 

(a) V(n) = 9" 

(b) P.wherePW - 6P{k - 1) + 5P{k - 2) = Oforyt > 2,withP(0) = 2andP(l) = 2. 

(c) The Fibonacci sequence: F (A: + 2) = F{k + V) + F{k),k > 0,withF(0) = F(l) = 1. 

4. Find closed form expressions for the generating functions of the following sequences: 

(a) W in) = C(5; n) 2" for < n < 5 and W (n) = for n > 5. 

(b) 2, where 2 (/t) + Q{k - \) - A2Q{k - 2) = for /t > 2, with 2 (0) = 2 and 2(1) = 2. 

(c) G,whereG(/(: + 3) = G {k + 2) + G (k + 1) + Gik)ioik > 0,withC(0) = C(l) = G (2) = 1. 

5. For each of the following expressions, find the partial fraction decomposition and identify the sequence having the expression as a generat- 
ing function. 

, , 5+2z 
(b) ''-''^ 



(C) 



2-3 z+z^ 
6-29 z 



l-llz+30z^ 

6. Find the partial fraction decompositions and identify the sequence having the following expressions: 
(a) 

^ ' l-9z^ 

(b) 
(c) 



16-8 z+z' 
2z 



l-6z-7z^ 

7. Given that S(k) = k and T (k) = 10 k, what is the A:* term of the generating function of each of the following sequences: 

(a) S + T 

(b) St *T 

(c) S*T 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed imder a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



Chapter 8 - Recursion and Recurrence Relations 



169 



(d) St*St 

8. Given that P(k) = C (10; k) and Q (k) = A: !, what is the tem of the generating function of each of the following sequences: 

(a) P *P 

(b) P + Pt 

(c) P*Q 

(d) 2*2 

C Exercises 

9. A game is played by rolling a die five times. For the fc* roll, one point is added to your score if you roll a number higher than k. Otherwise, 
yoiu" score is zero for that roll. For example, the sequence of rolls 2, 3, 4, 1,2 gives you a total score of three; while a sequence of 1,2,3,4,5 

gives you a score of zero. Of the 6^ = 7776 possible sequences of roUs, how many give you a score of zero?, of one? ... of five? 

10. Suppose that you roU a die ten times in a row and record the square of each number that you roU. How many ways could the sum of the 
squares of your roUs equal 40? What is the most common outcome? 
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I 8.6 Recursion and Computer Algebra Systems 

There is frequently debate as to if, how, and when computers should be introduced teaching a topic such as recurrence relations. We have 
chosen to append this information at the end of the chapter. If desired, instructors can intersperse the following subsections with the previous 
sections, cover this section a the end, or simply ignore this section. Motivated students are welcome to read it at any time, but should be 
warned that many instructors will still prefer that they do some of the calculations "by hand" within their courses. 

Note: At the present time, this section only describes the topics in Mathematica. If any readers with a knowledge of Sage or any other com- 
puter algebra system would like to contribute to this section, contact us through the addresses that appear at http://faculty.uml.edu/kl- 
evasseur/ads2/. 

Recursive Definitions (8.1 and 8.2) 
Mathematica 

Functions can be defined recursively in Mathematica. For example, the Fibonacci sequence, 1,1,2,3,5,8,13,... (each number is the sum of 
the previous two) can be defined with the following input. 

fib[0] = 1; 
fib[l] = 1; 

fib[i_ /; i > 1] := fib[i - 2] + fib[i - 1] 

The left side of the third line, fib [ i_/ ; i>l] indicates that fib is being defined in this case for a "pattern" called i satisfying the condition 
that i is greater than 1 . Any "input" that fits this pattern will have this rule applied to it. Notice that the definitions for f ib [ ] and f ib [ 1 ] 
coexist with this general rule and define fib for two numbers that don't fit the pattern described in the general rule. 

With the input above evaluated, we can determine the 12th Fibonacci number. 
fib[12] 

233 

This particular definition is not efficient because previously computed values of fib are not saved. As a result, the time it takes to compute 
f ib [ i ] is roughly proportional to the value of f ib [ i ] , which grows exponentially. This can be demonstrated with the following bit of code. 

fp = ListPlot[ ({ttl, fib[ttl] } &) /@Range[3, 21, 3] , 

Joined -» True, PlotLabel ^ "Plot of fib[i] "] ; 
tfp = ListPlot[ ({Bl, First [Timing [fib [Bl] ] ] } &) /@Range[3, 21, 3] , 

Joined -» True, PlotLabel ^ "Timing Plot for computing fib[i]"]; 
Show [GraphicsRow [{ fp, tfp} , ImageSize ^ 1 . 5 {288 , 84.75"}]] 

Plot of fib[i] Timing Plot for computing fib[i] 



15000 : 
10000 : 

5000 - 




10 15 20 10 15 20 

Taking over three seconds to compute the 30"" Fibonacci number is unacceptable. 
{Timing [fib [30] ] , $System} 

(13.32795, 1 346269), Mac OS X x86 (64-bit)) 

The timing result above depends on the system you use, so you might want to execute this expression for comparison. 

How to get around the timing problem. You could never get a value like fib [100] using the recursive definition because of the obvious 
time growth. To get around the problem, you can define a variation of the function that saves it's results. 

The following definition allows for saving the results that are computed. Notice that the SetDelay ( ; =) involves a Set (=). 

fib2 [0] =1; 
fib2 [1] = 1; 

fib2 [i_ /; i > 1] : = fib2 [i] = fib2 [i - 2] + fib2 [i - 1] 

Now let's compute the 30th number in the sequence with this new definition 
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Timing [fib2 [30] ] 

(0.000317, 13462691 

Now that the value of f ib[ 30] has been computed, it's value is returned almost instantly 

Timing [fib2 [30] ] 

{0.000012, 1346269) 
Now you can see what f ib2 [100] equals in a reasonable time. 

Timing [fib2 [100] ] 

(0.000499, 573 147 844013 817084 101) 

There is still a problem with using recursion. If we were to ask for the 500* Fibonacci number at this point, Mathematica would have difficulty 
because it still had 400 levels of recursion to negotiate. For this reason, getting to the 500* Fibonacci number is best reached by evaluating 
every 50* or so of the numbers instead of directly asking for the 500* 

Do[fib2 [50 k] , {k, 3, 10}]; 

fib2 [500] 

225 591 516 161 936330 872512695036 072072046011 324913 758 190588 638 866418474627 738 686 883 405015987 052 796-. 
968498626 

Solution of Recurrence Relations (8.3 and 8.4) 
Mathematica 

Mathematica has a function called RSolve that will solve some recurrence relations, including linear recurrence relations with constant 
coefficients. In addition, it can handle some systems of recurrence relations, which we will discuss in a Chapter 12. 

? RSolve 



KSo\\ie[eqn, a[n], n] solves a recurrence equation for aln]. 

RSolveKc^ni, eqrij, ...), a2[«], - I, «] solves a system of recurrence equations. 
RSolve[e^«, a[n,, n2, ...], In,, tii, ...|] solves a partial recurrence equation. » 

Example. Consider the sequence 5 defined as follows. 

S{k)= \3S{k-\) -0.7 with 5(0)= 3 

We can get the general solution to the recursive part of the definition, if desired. 

recurence = S [k] == 1 . 3 S [k - 1 ] - 0.7; 
RSolve [recur ence, S[k], k] 

[[S{k) ^ 1.3*-' (2.33333 y 2.7i828°-2''2-^''*-°^''"''" - 3.03333) + c, 1.3*-' )) 
Or we can add the initial condition to the first argument of RSolve and get a unique solution 
solution = RSolve [ {recurence, S[0] == 3}, S[k], k] 

{{5(/(:)^ 0.769231 x 2.71828-''^''2''^" (3.03333 \ 1.3*' + 0.866667 x 1.3*'2.71828°'«2364tj|| 
Notice that since approximate numbers 1.3 and 0.7 are involved in the definition of 5, the solution involves approximate numbers. If we had 

13 7 

used — and —, respectively, the solution would be exact. 
10 10 ^ •' 

Next we assemble a list of ordered pairs based on the solution. 
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points = Table [ {k, S[k]} /. First [solution] , {k, 0, 8}] 



fO 


3. ^ 


1 


3.2 


2 


3.46 


3 


3.798 


4 


4.2374 


5 


4.80862 


6 


5.55121 


7 


6.51657 


^8 


7.77154 ; 



Here is a plot of the first few terras of the solution. 

ListPlot [points, PlotStyle ^ {Red, PointSize [0. 03 ] } , AxesOrigin -» {0, 0}] 



2 4 6 8 

Example. Here is the solution to a second order linear recurrence relations with constant coefficients. Initial conditions included. Notice that 
the exact data in the systera produces exact expressions. 

RSolve[{T[k] - 6 T[k - 1] + 3 T[k - 2] == 2"k, T[0] == 2, T[l] == 3}, T[k], k] 



{{rw ^ -|^5 X 3'-* 1^-19 ^[2 3*+? ^3 - ^f6^^ - 28 x 3*+' ^3 - + 19 yj^ 3*^1 ^3 + ^f6 

28 X 3*+' (3 + ^f6 J + 3 x 2"^* ((3 - ^/e") (3 + V^"))! /(4 (2 ^ - 3)' (3 + 2 ^/e" 



Example. Here is the solution to a recurrence relation sirailar to the ones described in Section 8.4. The solution is in terras of the Gararaa 
function, which is a generalization of the factorial function. 

Usol = U[k] /. First [RSolve[{u[k] - kU[k - 1] == 2"'', U[0] == l}, U[k] , k] ] 

1- 



k + l, 



Here is a table of the first few terras of the solution. 
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Table [{j, N[Usol / . {k^ j}]}, {j, 0, 10}] 



( ^ 


1. \ 


1 


1.5 


2 


3.25 


3 


9.875 


4 


39.5625 


5 


197.844 


6 


1187.08 


7 


8309.55 


8 


66476.4 


9 


598288. 


.10 


5.98288 xlO« , 



Example. Here is the solution to the recurrence for the number of derangements of a set of n elements, £)(n). We use D instead of D here 
because Mathematica reserves the name D for the derivative function. 

derangementSol = RSolve [ {S [n] == (n - 1) ( © [n - 1 ] + © [n - 2] ) , 2) [1] == 0, ©[2] == 1} , ©[n] , n] 

((£)(«) ^ (r(2, -1) Y(n + 1) - r(n + 1, -l))/(2r(2, -1) - r(3, -1)))) 

Table[{n, ©[n]} /. First [derangementSol] /. {n^k}, {k, 1, 10}] //Round 



( 1 


^ 


2 


1 


3 


2 


4 


9 


5 


44 


6 


265 


7 


1854 


8 


14833 


9 


133496 


, 10 


1334961 , 



Generating Functions (8.5) 

The examples in Section 8.5 solving recurrence relations with generating functions were selected to keep the algebra reasonably neat. Here we 
will step through the solution of a second order linear recurrence relation with constant coefficients for which the numbers are not so nice. 
Using Mathematica, the messy work isn't a problem. 

Mathematica 

Consider the recurrence relation S{k) = -S{k - 1 ) + 5 S{k - 2) , with S (0) = 4 and S ( 1 ) = 3 . In order to check our computations , lets define S 
recursively and get its first few terms: 

S[0] = 4; S[l] = 3; 

S[k_] : = S[k] = -S[k-1] +5S[k-2] 

Map[S, Range [0, 10]] 

{4, 3, 17, -2, 87, -97, 532, -1017, 3677, -8762, 27 147} 

We can find the generating function for S by observing that 5 = -5T +5 5"r2 . Using the formulas for the generating functions of ST and 
5t2 that were derived in Section 8.5, we get the following equation which we can solve for G = G(S; z): 
Clear [6] 

(G-S[0] -S[l] z) (G-S[0]) 

gfEquation = + 5 G == 

z 

G-3z-4 G-4 
+ 5G = 

z 

We solve for G and since there is only one solution, we use First to extract that single solution from the output of Solve. 
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6sol = 6 / . First [Solve [gf Equation, 6]] 

-7z-4 
Sz^-z-l 

At this stage, we can extract a finite number of coefficients from the generating function. We do so here, but it only verifies the list of terms we 
computed above. We stiU don't have a closed form expression for S. 

serlesExpanslon = Series [6sol, {z, 0, 10}] 

4 + 3 z + 17 - 2 + 87 z'' - 97 + 532 z« - 1017 z' + 3677 z* - 8762 z' + 27 147 z'" + 0(z") 
CoefflclentLlst [serlesExpanslon, z] 

{4, 3, 17, -2, 87, -97, 532, -1017, 3677, -8762, 27 147} 

In order to get a close form expression we need to get partial fractions decomposition of the generating function. The function Apart is 
meant to do this, but the next result isn't very encouraging 

Apart [6sol] 

-7z-4 
Sz'-z-l 

The difficulty is that Apart uses the Factor function that only factors polynomials over into factors with rational coefficients. 
Factor [6sol] 

7z + 4 

5z^-z-l 

In order for factor to work on this denominator, we need to tell Mathematica what square root should be included. The discriminant of the 
denominator is 

Discriminant [5 z^ - z - 1, z] 

21 

Therefore we need to extend the rational numbers to include We win discuss extensions in a more formal setting in Chapter 16. 

Factor[6sol, Extensions {V2l]-] 

(20 (7 z + 4))y^^^-10 z + V21~ + l) (lO z + V^l" - l)] 
With this factored expression, Apart will do its job. 

PF = Apart [Factor [Gsol, Extension {Sqrt[21] }] ] 

47-7 V2r 47 + 7 ^/W 
+ ——^ 

V2r(ioz+ V2r- 1) V2r(-ioz+ V2r+ ij 

We wiU work with the two terms above, both of which is the generating function of a geometric sequence, individually to get an expression that 

look like 610/+ 62 02*- The problem is to find the hi' s, which are initial terms of the generating functions and the a, ' s with are the ratios of 
successive coefficients of the generating functions. 

6S1 = PF[ [1] ] 

47 + 7V2r 

V2r(-ioz+ V2r+ ij 

Why isn't the first term that appears in the output for PF the same as the first part that we called 6S1? It has to do with output forms in 
Mathematica and let's not get into that right now. This all works out, believe me! 

A quick word on the output from Series: If the series expansion up to degree n for a function /(z) centered at is + z + ■ • • + a„ z", then 
the internal structure of the output from Series is a SerlesData expression. The variable and center of the series expansion are the first 
two arguments of this expression, and the third is the list of coefficients, which is what we wiU be extracting here. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed imder a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



Chapter 8 - Recursion and Recurrence Relations 



175 



InputForm [series [e^'^, {z, 0, 4}]] 

SeriesData[z, 0, {1, 2, 2, 4/3, 2/3}, 0, 5, 1] 

The last three argument of the output indicate the powers of the variable to attach to the variable, through 5 in steps of 1 in the example 
above. The first part of the third argument is the first term of the first geometric sequence. 

bi = Series[GSl, {z, 0, 5}] [ [3, 1] ] 

147 + 47 ^f2^ 



To extract the quotient of the second with the first arguments we use the pure function (tt[[3, 2]]/tt[[3, 1]]) &. This is the common ration of 
the first geometric sequence. 

ai = Series [6S1, {z, 0, 5}] // (tt[[3, 2]] /tt[[3, 1]]) & 

10 

1 + ^/2^ 

We repeat the same calculations for the second geometric sequence. 
GS2 = PF[ [2] ] 

47-7V2r 

V2r(ioz+ V2r- 1) 

b2 = Series[6S2, {z, 0, 5}] [ [3, 1] ] 

47 ^f2^ - 147 



32 = Series [6S2, {z, 0, 5}] // (tt[[3, 2]] /tt[[3, 1]]) & 

10 

V2r- 1 

We collect the four constants we've just computed into the sum of two geometric sequences. 
Ssol [k_] : = bi ai'' + b2 32"^ 

Next we test our expression to see if it matches the two other lists of terms if S that we've computed. The result may be disturbing, but the 
reason for the difference is that Mathematica hold off simplifying some expressions until requested. This is the case here. 
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Map[Ssol, Range [10]] 

10 (l47 + 47 V2r) 10 (47 ^/2^ - 147) 



2l(l+V2r)^ 2l(V2r-l)^ 

100 (47 V2r - 147) 100(l47 + 47V2r) 1000 (l47 + 47 V2r) 1000 (47 V2r - w) 

+ , , 

21 (V2r - 1)^ 21 (1 + V2r)' 21 (1 + ^21^* 21 {^f2^ - 1)* 

10000 (47 V2r- 147) 10000 (147 + 47 V2r) 100000 ( 147 + 47 V2r) 100000 (47 V2r - 147) 

+ , , 

2i(V2r-i)' 2i(i + V2r)' 2i(i + V2r)'' 2i(V2r-i)'' 

1000000(47V2r- 147) 

2i(V2r- 1)^ 

10000000 (147 + 47 V2r] 



■ ^1 000000 ^147 + 47 V^)) / (^^ + V^) 
- ^10000000 ^47 yfll - 147 jj ^ |^21 ^^21" - 1 



21(1 +V2r) 

(100000000 (47 yfll - 147 jj I ^21 ^-yjll - ij'j + (100000000 (147 + 47 /(^^ 

(1 000000000 (147 + 47 V^^)) / (1 + V^^) ) " (1 000000000 ^47 ^21" - 147 jj ^ (2I [^[ll - ij j, 

(10000000000 (47 ^[21 - 147jj I (21 - ij" j + (10000000000 (147 + 47 V^l")) / (21 (1 + V^l")"]} 

The simplified result should look familiar. 

Map[Ssol, Range[0, 10]] // Simplify 

{4, 3, 17, -2, 87, -97, 532, -1017, 3677, -8762, 27 147} 
Finally, the closed form expression for our sequence can be examined. 
Ssol [k] 

^ (47 ^[21 - 147j (-10)* [^[21 - ij + ^ (l47 + 47 yfll^ 10* (l + yfll^ * 

Exercises for Section 8.6 
A Exercises 

1 . If B(0) = 1000, and B(n) = 1 .05'/" B(n - 1) + 1000 for « > 1 , compute B(20) and plot the values of B for < n < 20. 

2. Compute the a closed form expression for F(n), where F(0) = 1 , F(l) = 2, F(2) = 2.5, and if A: > 3, 

Fik) = 0.9 F(/t - 1) + 0.52 Fik - 2) - 0.42 F(k - 3). 
As k gets large, what does F{k) tend toward? 

3. Use generating functions to find a closed form expression for 

WiO) = l, M^(l) = l 

W{n) = 3 W(n - 1) + W(n - 2) whenn > 2 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 8 

Section 8.1 

1 . Write out a recurrence relation to describe the number of n digit positive integers (in decimal form) that contain no repeating digits. 

2. The number of k subset partitions of an n-element set, S (n, k), satisfies the recurrence relation S(n, k) = kS(n — l,k) + S(n — l,k — I) for 
n > 3 and 2 <k <n. The numbers S (n, k) are called Stirling Numbers of the Second Kind. 

(a) What are S(n, 1) and 5 (n, n) for n > 1? 

(b) Compute 5 (5 , 2) , 5(5 , 3) , and 5 (5 , 4) . 

(c) How many partitions are there of a five-element set? 

n 

(d) What is the significance of J] S(n, k)l 

k=] 

3. Consider the following algorithm, called Split: 

Input: a list, Lin = (a], a2, ■■■ , a„) of n numbers, where n is a natural number 
Output: a list. Lout = (bi, b2 ■■■ , b„) of n numbers If n < 2, then Lout := Lin 
else {n > 2} 

1. Let L 1 := (ai, 03, 05 ...) andL2 := (02, 04, ag, ...) 

2. Execute Split with input L 1 and output L 1 out 

3 . Execute Split with input L2 and output L 2 out 

4. Lout := (L 1 out, L 2 out), that is, the list obtained by copying L 1 out and then copying L 2 out 

(a) What is the output from Split if the input is (1, 2, 3, 4)? 

(b) What is the output from Split if the input is (0, 1,2, ... , n - 1), where n is equal to 2r for some natural number r? Hint: the 
general answer is related to the binary representations of the numbers in the Ust. 

Section 8.2 

4. Prove by induction that S(k) = (2 + k) 10* is a solution of the recurrence relation S(k) = 20 S(k - 1) - 100 S(k - 2); S(0) = 2, S(l) = 30. 

5. Explain why B{n) = 1 + 2 + • • • + 2" is not a closed form expression. 

6. Between them, Abe and Zeke have n coins. They each flip one coin and if they match (both heads or both tails), then Abe keeps both coins. 
Otherwise, Zeke keeps both coins. They continue as long as both have at least a coin. Let L (njc) be the expected number of flips that take place 
in the game if Abe starts with k coins. IfO < k < n, then 

L(n, k) = l+ {Un, k-\)+ Un, k + l))/2 

(a) Justify this equation. 

(b) What are L{n, 0) and L{n, n)? 

(c) Tabulate L(n, k) for n = 2, 3, and 4. 

Section 8.3 

7. On her nth birthday, Kathryn receives n dollars from her Uncle Dave and deposits it into a special account that pays interest of 10 % each 
year. On her 21st birthday, how much does she have in the account? 

8. Suppose you borrowed $4,000 at 8% interest and you made payments of $250 per month. Let D(n) be your debt n months after taking out 
the loan. Then Z)(0) = 4000 and D{n) = 1 .08 D(n - 1) - 250 for n > 0. 

(a) Derive a closed form expression for D (n). 

(b) Estimate how long it would take you to pay off the loan. Use logs if necessary. A rough estimate is sufficient. 

9. (Tower of Hanoi Problem): A classic problem that can be solved using material of this section is the Tower of Hanoi puzzle. Assume that we 
have a board with three pegs mounted on it and on one of the pegs n circular disks of decreasing size (smallest on top). The problem is to 

determine how many moves it takes to move the n disks from one peg to another without placing a larger disk on a smaller one at any time. 

(a) If X(n) stands for the number of moves it takes to move n disks from one peg to another, then X(n) = 2 X(n - 1) + 1 . Justify this recurrence 
relation. 

(b) Solve the recurrence relation in part a. Verify your result, and the formula in part a, using three disks. 
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Section 8.4 

10. Suppose that T(0) = and T(a) = 1 + r(La/2J) for a > 0. If a = (1 101001 Oil Oil nO)^„ , what is T(a)l 

11. Find a solution forn = 2* : g(n) = n + e(Ln = 4J), n >l,e(0) = 0. 

12. The recurrence relation x(n + 1) = 1 + x(n)^ with x(0) = 1 has a solution that is approximately equal to c'*"' , where c is a constant and 

t(n) = 2", Estimate the value of c based on values of x(0), x(l) , ... , x(4). Test your answer by computing x(5) and comparing it to c'*'' . The 
significance of x (n) is that it equals the number of binary trees of depth n or less (see Chapter 10). 

Section 8.5 

13. Write out the first five terms of the generating function of sequence S where 5(0) = 1 and for ^ > 0, S(k,) = 5(0) + • • • + S(k - 1). 

14. (a) Determine the closed form expression for the terms of sequence 5, 
where 5(A:) - 6 5(yt - 1) = for ^ > 1 and 5(0) = 7. 

(b) What is the generating function, G(5; z), for sequence 5? Write out the first four terms of your answer and then write out the closed form 

expression for the generating function. 

15. Suppose that 5 is a sequence with generating function G (5; z) and that T has generating function G(5; cz). How is T related to 5? 
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GRAPH THEORY 



Bipartite 

Draw some lines joining dots in set A 

To some dots in set B. Then we say 

It' s bipartite if we 

Have no "B" joined to "B" 

And no "A" joined to "A" . That okay? 

- a limerick by Chris Hewlett from the The Omnificent English Dictionary In Limerick Form 

GOALS 

This chapter has three principal goals. First, we will identify the basic components of a graph and some of the optional features that many 
graphs have. Second, we will discuss some of the questions that are most commonly asked of graphs. Third, we want to make the reader aware 
of how graphs are used to model different situations. In Section 9.1 , we will discuss these topics in general, and in later sections we will take a 
closer look at selected topics in the theory of graphs. 

Chapter 10 will continue our discussion with an examination of trees, a special type of graph. 
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I 9.1 Graphs— A General Introduction 

Recall that we introduced directed graphs in Chapter 6. 

Definition: Directed Graph. A directed graph consists of a set of vertices, V, and a set of edges, E, connecting certain elements of V. 
Each element of E is an ordered pair from V (i.e., an element ofVxV). The first entry is the initial vertex of the edge and the second entry is 
the terminal vertex. In certain cases there will be more than one edge between two vertices, in which cases the different edges are identif ied 
with labels. 

Despite the set terminology in this definition, we usually think of a graph as a picture, an aid in visualizing a situation. In Chapter 6, we 
introduced this concept to help understand relations on sets. Although those relations were principally of a mathematical nature, it remains true 
that when we see a graph, it tells us how the elements of a set are related to one another. 

Definition: Simple Graph and M altigraph. A simple graph is one for which there is no more than one vertex directed from any vertex to 
another vertex. All other graphs, ones with at least two edges from one vertex to some other vertex, are called multigraphs. 

To illustrate the points that we will make in this chapter, we will introduce the following examples of graphs. 

Example 9.1.1. A Directed Graph. Figure 9.1.1 is an example of a simple directed graph. In set terms, this graph is (V, E), where 
V = {s, a, b] and E = {(s, a), {s, b), (a, b), {b, a), (b, b)]. Note how each edge is labeled either or 1. There are often reasons for 
labeling even simple graphs. Some labels are to help make a graph easier to discuss; others are more significant. We will discuss the signifi- 
cance of the labels on this graph later. 




Figure 9.1.1 

A directed graph 

Example 9.1.2. An Undirected Graph. A network of computers can be described easily using a graph. Figure 9.1.2 describes a 
network of five computers, a, b, c, d, and e. An edge between any two vertices indicates that direct two-way communication is possible 
between the two computers. Note that the edges of this graph are not directed. This is due to the fact that the relation that is being displayed is 
symmetric (i.e., if X can communicate with Y, then 7 can communicate with X). Although directed edges could be used here, it would simply 
clutter the graph. 

a c 



Figure 9.1.2 

An undirected grapti 

There are several other situations for which this graph can serve as a model. One of them is to interpret the vertices as cities and the edges as 
roads, an abstraction of a map such as the one in Figure 9.1.3. Another interpretation is as an abstraction of the floor plan of a house (Figure 
9.1 .4). Vertex a represents the outside of the house; all others represent rooms. Two vertices are connected if there is a door between them. 
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Figure 9.1 .3 

Road Map 



Definition: Undirected Graph. An undirected graph consists of a set V, called a vertex set, and a set E of two-element subsets of V, 
called the edge set. The two-element subsets are drawn as lines connecting the vertices. 

The undirected graph of Figure 9.1.2 is V = [a, b, c, d, ejandE = {{a, b,}, [a, d], [b, c), [b, d], [c, e], [b, e}}. 

Definition: Complete Undirected Graph. A complete undirected graph of n vertices is an undirected graph with the property that each 
pair of distinct vertices are connected to one another. Such a graph is usually denoted by K„. 

Example 9.1.3. A Multigraph. A common occurrence of a multigraph is a road map. The cities and towns on the map can be thought of 
as vertices, while the roads are the edges. It is not uncommon to have more than one road connecting two cities. In order to give clear travel 
directions, we name or number roads so that there is no ambiguity. We use the same method to describe the edges of the multigraph in Figure 
9.1 .5. There is no question what e3 is; however, referring to the edge (2, 3) would be ambiguous. 



■G 




Figure 9.1.5 

A multigraph 

Example 9.1.4. A flowchart is a common example of a simple graph that requires labels for its vertices and some of its edges. Figure 
9.1 .6 is one such example that illustrates how many problems are solved. 
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Figure 9.1.5 

Flowchart for the problem-solving process 

At the start of the problem-solving process, we are at the vertex labeled "Start" and at the end (if we are lucky enough to have solved the 
problem) we will be at the vertex labeled "End." The sequence of vertices that we pass through as we move from "Start" to "End" is called a 
path. The "Start" vertex is called the initial vertex of the path, while the "End" is called the final, or terminal, vertex. Suppose that the problem 
is solved after two attempts; then the path that was taken is Start, R, A, Q, L, A, Q, End. An alternate path description would be to list the 
edges that were used: 1, 2, 3, No, 4, 3, Yes. This second method of describing a path has the advantage of being applicable for multigraphs. 
On the graph in Figure 9 . 1 .5 , the vertex list 1 , 2 , 3 , 4 , 3 does not clearly describe a path between 1 and 3 , but e i , 64 , ee , e? is unambiguous . 

A SUMMARY OF PATH NOTATION AND TERMINOLOGY 

If X and y are two vertices of a graph, then a path between x and y describes a motion from x and y along edges of the graph. Vertex x is called 
the initial vertex of the path and y is called the terminal vertex. A path between x and y can always be described by its edge list, the list of edges 
that were used: (ei, e^, e„), where : (1) the initial vertex of e\ is x\ (2) the terminal vertex of e; is the initial vertex of , 
/= 1, 2, n - 1; and (3) the terminal vertex of e„ is >>. The number of edges in the edge list is the path length. A path on a simple graph 
can also be described by a vertex list. A path of length n will have a list of « + 1 vertices vo = x, vi, V2, ...,v„=j, where, for 
k = 0, 1,2, . . . , « - 1 , (vj , Vj-_|_i) is an edge on the graph. A circuit is a path that terminates at its initial vertex. 

Suppose that a path between two vertices has an edge list (e„ e 2 , . . . ,e„). A siihpath of this graph is any portion of the path described by one 
or more consecutive edges in the edge list. For example, (3, No, 4) is a subpath of (1, 2, 3, No, 4, 3, Yes). Any path is its own subpath; 
however, we call it an improper subpath of itself. All other subpaths are called proper subpaths. 

A path or circuit is simple if it contains no proper subpath that is a circuit. This is the same as saying that a path or circuit is simple if it does not 
visit any vertex more than once except for the common initial and terminal vertex in the circuit. In the problem-solving method described in 
Figure 9.1 .6, the path that you take is simple only if you reach a solution on the first try. 

Example 9.1.5. The leadership structure of a corporation is often represented with a graph as in Figure 9.1 .7. 
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Figure 9.1.7 

Organization of a corporation 

The principle behind such a structure is that everyone but the president has a single immediate supervisor. Any action that anyone takes can 
reach the president only through a unique "chain of command." This chain-of-command property is characteristic of a special type of graph 
called a tree. Note that the edges of this graph are not directed, but, as in a Hasse diagram, the relation between two connecting vertices is clear: 
the top vertex is the supervisor of the lower vertex. 

The process of structured (or top-down) problem solving results in a graph that is similar to this tree. Starting with the top of the tree, which 
would represent the whole problem, the problem is divided into a sequence of separate subproblems. Each subproblem is divided further into 
smaller sub-problems in the same way until the solutions of the lowest problems are easy enough to recognize. 

From these examples, we can see that although a graph can be defined, in short, as a collection of vertices and edges, an integral part of most 
graphs is the labeling of the vertices and edges that allows us to interpret the graph as a model for some situation. 

Example 9.1.6. A Graph as a Model for a Set of Strings. Suppose that you would like to mechanically describe the set of strings of O's 
and I's having no consecutive I's. One way to visualize a string of this kind is with the graph in Figure 9.1.1. Consider any path starting at 
vertex s. If the label on each graph is considered to be the output to a printer, then the output will have no consecutive I's. For example, the 
path that is described by the vertex list {s, a, h, b, a, b, b, a, b) would result in an output of 10010010. Conversely, any string with no 
consecutive I's determines a path starting at s. 

Example 9.1.7. A Tournament Graph. Suppose that four teams compete in a round-robin sporting event; that is, each team meets every 
other team once, and each game is played until a winner is determined. If the teams are named A, B, C, and D, we can define the relation fj on 
the set of teams by X fiY \iX beat Y. For one set of results, the graph of fi might look like Figure 9.1 .8. 
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Figure 9.1.8 

Round-robin graph with four vertices 

Definition: Tournament Graph. 

(a) A tournament graph is a directed graph with the property that no edge connects a vertex to itself, and between any two vertices there is at 
most one edge. 

(b) A complete (or round-robin) tournament graph is a tournament graph with the property that between any two distinct vertices there is 
exactly one edge. 

(c) A single-elimination tournament graph is a tournament graph with the properties that: (i) one vertex (the champion) has no edge 
terminating at it and at least one edge initiating from it; ( ii) every other vertex is the terminal vertex of exactly one edge; and ( Hi) there is a 
path from the champion vertex to every other vertex. 

Example 9.1.8. The major league baseball championship is decided with a single-elimination tournament, where each "game" is actually 
a series of games. Until 1995, the two divisional champions in the American League (East and West) compete in a series of games. The loser 
is eliminated and the winner competes against the winner of the National League series (which is decided as in the American League). The 
toumament graph of the 1983 championship is in Figure 9.1.9. 



Baltimore Ctiicago 

ALEast ^ ALWest 



Philadelphia Los Angeles 

NL East ^ NL West 



Figure 9.1.9 

1983 Major League Baseball Championship 

The question "Once you have a graph, what do you do with it?" might come to mind. The following list of common questions and comments 
about graphs is a partial list that will give you an overview of the remainder of the chapter. 
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Question 1. How can a graph be represented as a data structure for use on a computer? We will discuss some common Pascal data structures 
that are used to represent graphs in Section 9.2. 

Question 2. Given two vertices in a graph, does there exist a path between them? The existence of a path between any or all pairs of vertices in 
a graph will be discussed in Section 9.3. A related question is: How many paths of a certain type or length are there between two vertices? 

Question 3. Is there a path (or circuit) that passes through every vertex (or uses every edge) exactly once? Paths of this kind are called traver- 
sal. We will discuss traversals in Section 9.4. 

Question 4, Suppose that a cost is associated with the use of each vertex and/or edge in a path. What is the "cheapest" path, circuit, or traversal 
of a given kind? Problems of this kind will be discussed in Section 9.5. 

Question 5. Given the specifications of a graph, or the graph itself, what is the best way to draw the graph? The desire for neatness makes this a 
reasonable question. Another goal might be to avoid having edges of the graph cross one another. This is discussed in Section 9.6. 

ISOMORPHIC GRAPHS 

We will close this section by establishing the relation "is isomorphic to," a form of equality on graphs. The graphs in Figure 9.1.10 obviously 
share some similarities, such as the number of vertices and the number of edges. It happens that they are even more similar than just that. If the 
letters a, b, c, and d in a are replaced with the numbers 1,3,4, and 2, respectively, and they are moved around so that they appear as in b, you 
obtain b. 




Figure 9.1.10 

Two Isomorphic Graphs 

Here is a more precise definition that reflects the fact that the actual positioning of vertices isn't an essential part of a graph. 

Definition: Isomorphic Graphs. Two graphs {V, E) and (V, E') are isomorphic if there exists a bijection f:V->V' such that 
[vj, v,j e E if and only if[f(vi), f[vj)) e E'. For multigraphs, we add that the number of edges connecting v; to vj, must equal the number of 
edges from fivi) to /(v,). 

Degrees and Graphic Sequences 

The most significant local characteristic of a vertix within a graph is its degree. Collectively, the degrees can partially characterize a graph. 
Definition: Degree. 

(a) Let V be a vertex of an undirected graph. The degree of v, denoted degfvj, is the number of edges that connect v to the other vertices in 
the graph. 

(b) If V is a vertex of a directed graph, then the outdegree of v, denoted outdegfv), is the number of edges of the graph that initiate at v. The 
indegree ofv, denoted indegfvj. is the number of edges that terminate at v. 

Example 9.4.9. 

(a) The degrees of vertices 1 through 5 in Figure 9.1.11 are 2, 3, 4,1, and 2, respectively. 
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Figure 9. 1.11 

(b) In a tournament graph, outdeg (v) is the number of wins for v and indeg (v) is the number of losses. In a complete (round-robin) tour- 
nament graph with « vertices, outdeg (v) -I- indeg (v) = n - 1 for each vertex. 

Graphic Sequences 

A finite nonincreasing sequence of integers di, d2, d„ is a graphic if there exists a simple graph with n vertices having the sequence as its 
degree sequence. For example, 4,2, 1, 1, 1, lis graphic because the degrees of the following graph match these numbers. 



3»- 




Note: There is no connection between the vertex name/number and its degree. 

EXERCISES FOR SECTION 9.1 
A Exercises 

1. What is the significance of the fact that there is a path connecting vertex b with every other vertex in Figure 9.1.2, as it applies to various 
situations that it models? 

2. Draw a graph similar to Figure 9.1.1 that represents the set of strings of O's and I's containing no more than two consecutive I's. 

3. Draw a directed graph that models the set of strings of O's and I's where all of the I's must appear consecutively. 

4. In the NCAA final-four basketball tournament, the East champion plays the West champion, and the champions from the Mideast and 
Midwest play. The winners of the two games play for the national championship. Draw the eight different single-elimination tournament graphs 
that could occur. 

5 . What is the maximum number of edges in a simple undirected graph with eight vertices? 

6. Which of the graphs in Figure 9.1.11 are isomorphic? What is the correspondence between their vertices? 
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Figure 9.2.11 



7. (a) How many edges does a complete tournament graph with n vertices have? 

(b) How many edges does a single-elimination tournament graph with n vertices have? 

8. Draw complete undirected graphs with 1, 2, 3, 4, and 5 vertices. How many edges does a A^„, a complete undirected graph with n vertices, 
have? 

9. Determine whether the following sequences are graphic. Explain your logic. 

(a) 6,5,4,3,2,1,0 

(b) 2,2, 2, 2, 2, 2, 2 

(c) 3, 2, 2, 2, 2, 2 
(c)5, 3, 3, 3, 3, 3 

(e) 1, 1, 1, 1, 1, 1 

(f) 5,5,4,3,2, 1 

10. (a) Based on observations you might have made in exercise 9, describe as many characteristics as you can about graphic sequences of 
length n. 

(b) Consider the two graphs 




Notice that they have the same degree sequences, 2, 2, 2, 2, 2, 2. Explain why the two graphs are not isomorphic. 
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I 9.2 Data Structures and Computer Generation of Graphs 

In this section, we will describe data structures that are commonly used to represent graphs. In addition we will introduce the basic syntax for 
graphs in Mathematica and Sage. 

Assume that we have a graph with n vertices that can be indexed by the integers \ ,2, ...,n. 

Data Structure 1: Adjacency Matrix. As we saw in Chapter 6, the Information about edges in a graph can be summarized with an adjacency 
matrix, G, where Gy = 1 if and only if vertex / is connected to vertex j in the graph. Note that this is the same as the adjacency matrix for a 
relation, with the exception. 

Data Structure 2: Edge List I. Note that the initializing procedure for an adjacency matrix presumes that a list of edges for the graph exists. 
This second data structure maintains this list form. For each vertex in our graph, there will be a list of edges that initiate at that vertex. If G 
represents the graph's edge information, then G/ would be the list of edges initiating at vertex i. 

Data Structure 3: Edge List II. An even simpler way to represent the edges is to maintain a list of ordered pairs. 




Figure 9.2.1 



Example 9.2.1. Given the graph in Figure 9.2.1 , the adjacency matrix that represents the graph would be 



rO 


1 





1 ^ 








1 


1 








1 





^ 1 








J 



The same graph would be represented with the edge list of type 1 : 
G, =112,41,13,4), (3), HI) 

Finally, the list 

G, =||1,2),|1,4), (2,3),(2,4), |3, 3), (4, 1)) 
describes the same graph with an edge list of type 2. 

A natural question to ask is: Which data structure should be used in a given situation? For small graphs, it really doesn't make much difference, 
but since answers the question "Is there an edge from vertex / to vertex j is easiest with an adjacency matrix, the adjacency matrix would be the 
natural choice. For larger matrices the edge count would be a consideration. If n is large and the number of edges is relatively small, it might 
use less memory to maintain a list of edges instead of building an n x n matrix. Most software for working with graphs will make the decision 
for you. 

:^|^ Mathematica Note 

First, a short history of graphs in Mathematica. Until Mathematica 6.0, all graph related functions were part of a package called Combinator- 
ica. A few graph related functions, notably GraphPlot, were introduced in version 6.0. The output from GraphPlot was (and still is) a 
Graphics object. Starting in Mathematica 8.0, graphs became native Mathematica objects. Graph expressions are backwards compatible in that 
Comhinatorica is still available. In addition, GraphPlot is still a good way to fine-tune the way a graph appears. 

The graph in Example 9.2.1 was drawn by evaluating the following expression. 
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Gi = GraphPlot[{l -»2, 2^3, 2^4, 4-»l, 1^4, 3-»3}, 
VertexLabeling -> True, DirectedEdges -» True] 




The same graph, with a different embedding and style of displaying vertices can be produced using Graph. 



Gi = Graph [{1, 2, 3, 4}, {1^2, 2-»3, 2-»4, 4^1, 1^4, 3-»3}, 
VertexLabels "Name", ImagePadding -» 5] 

1 





From both expressions one sees that an edge list in the form of Mathematica rules is used to specify the edges. Using GraphPlot, the 
vertices are implicit from the numbers that appear among the edges. This has the drawback of not allowing for an isolated vertex without any 
edges connected to it. A second acceptable way to specify edges is using GraphPlot is with an adjacency matrix: 



G, = 



GraphPlot [G2 , VertexLabeling -» True, DirectedEdges -» True] 




Notice that in these examples, the placement of the vertices is determined by Mathematica. Both functions have options that do allow for 
placement. 

An example of a system graph function is GraphDistance. For example, we can ask for the distance from vertex 1 to vertex 3 in Gi and 
find that one needs to travel along two edges to go from 1 to 3. 

GraphDistance [Gi, 1, 3] 

2 

In Gi , you can't get from vertex 3 to vertex 4, so the distance is infinite: 
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GraphDistance [Gi , 3, 4] 

oo 

We can get the adjacency matrix of Gi or the edge list of G2 with package function 
Ad j acencyMatr ix [Gi ] 

SparseArray[<6>, |4, 4)] 

In practice, graphs are often quite large, so Mathematica automatically uses a sparse array data structure to store the adjacency matrix. To see 
the actual matrix you can use the Normal function: 

Normal [Adj acencyMatrix [Gi] ] 

/O 1 1 \ 
11 
10 

,1 0, 

Larger graphs can be generated by creating lists of edges using rules. For example, here is graph of all divisors of 6 ! = 720 with an edge 

j 72 
connecting j to ^ if - is a prime. For example, there is an edge connecting 72 to 24 because — = 3 is prime. The code for creating G3 may 

k 24 

look daunting, but essentially the set of ordered pairs in the set 

Divisors[6!] = (1, 2, 3, 4, 5, 6, 8, 9, 10, 12, 15, 16, 18,20,24, 30,36,40,45,48,60,72,80,90, 120, 144, 180,240,360,720) 

and the relatively few ordered pairs where the first number divided by the second number is prime are selected out. These ordered pairs are 
then converted to a rule: [a, b] is converted to a ^ b. 

edges = Map [Rule @@ n & , Select [Tuples [Divisors [6 !] , 2], PrimeQ [Divide @@ n] &] ] ; 
G3 = Graph [edges, VertexLabels -» "Name", ImagePadding ^ 5] 




The adjacency matrix of G3 can certainly be considered sparse since it has relatively few I's. 
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Adj acencyMatrix [G3 ] //Normal 
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^1 Sage Note 
1^ 



Sage graphs are specified using version 1 of edge lists. Here is how the graph in Example 9.2.1 is generated and then displayed. 



sage: Gl = DiGraph( { 1 : [4, 2], 2 : [3, 4], 3 : [3], 4 : [1]}) 
Gl.show( ) 




There are many special graphs and graph families that are available in Sage through the graphs module. They are referenced with the prefix 
graphs . followed by the name and zero or more paramenters inside parentheses. Here are a couple of them, first a complete graph with five 
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sage : graphs .CompleteGraph ( 5 ) . show( ) 




Here is a wheel graph, named for an obvious pattern of vertices and edges. We assign a name to it first and then show the graph without 
labeling the vertices. 



sage: w=graphs .WheelGraph( 2 ) 
sage : w. show( vertex_labels=f alse ) 




There are dozens of graph methods, one of which determines the degree sequence of a graph, 
sage: w. degree_sequence ( ) 

[19^ ^ f ^ f 
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The degree sequence method is defined within the graphs module, but the prefix graphs . isn't needed because the value of w inherits the 
graphs methods. 

Example 9J,J,. Consider an directed graph represented by the Division I NCAA college basketball teams in the United States for a 
given year. There are approximately 350 teams in Division 1 . Suppose we constructed the graph with an edge from team A to team B if A 
beat B at least once in the season; and we label the edge with the number of wins. Since the average team plays around 30 games in a 

30 3 'SO 

season, most of which will be against other Division 1 teams, we could expect around — = 5250 edges in the graph (this is somewhat 
reduced by games with lower division teams and cases where two or more wins over the same team produces one edge). Since 5250 is much 
smaller than 350^ = 122500 entries in an adjacency matrix, we would consider this a sparse graph and would expect to find one of the edge 
list structures most efficient. Even if an adjacency matrix is created, as could be done in Mathematica, it would be represented using a sparse 
matrix data structure. 

EXERCISES FOR SECTION 9.2 
A Exercises 

1. Estimate the number of vertices and edges in each of the following graphs. Would the graph be considered sparse? 

(a) Vertices: Cities of the world that are served by at least one airline. 
Edges: Pairs of cities that are cormected by a regular direct flight. 

(b) Vertices: ASCII characters. 

Edges: connect characters that differ in their binary code by exactly two bits. 

(c) Vertices: All English words. 

Edges: An edge connects word x to word y if x is a prefix of y . 

2. Each edge of a graph is colored with one of the four colors red, blue, yellow, or green. How could you represent the edges in this graph 
using a variation of the adjacency matrix structure? 

3. Directed graphs Gi, , each with vertex set |1, 2, 3, 4, 5) are represented by the matrices below. Which graphs are isomorphic? 
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4. The following Sage command verifies that the wheel graph with four vertices is isomorphic to the complete graph with four vertices. 



graphs .WheelGraph( 4 ) . is_isomorphic ( graphs . CompleteGraph ( 4 ) ) 



Evaluate the expression dir ( graphs . ) in Sage to get a list of graph names, and then find two more pairs of isomorphic graphs. 
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I 9.3 Connectivity 

This section is devoted to a question that, when posed in relation to the graphs that we have examined, seems trivial. That question is: Given 
two vertices, i and t, of a graph, is there a path from s to t? If s = t, this question is interpreted as asking whether there is a circuit of positive 
length starting at j. Of course, for the graphs we have seen up to now, this question can be answered after a brief examination. 

There are two situations under which a question of this kind is nontrivial. One is where the graph is very large and an "examination" of the 
graph could take a considerable amount of time. Anyone who has tried to solve a maze may have run into a similar problem. The second 
interesting situation is when we want to pose the question to a machine. If only the information on the edges between the vertices is part of the 
data structure for the graph, how can you put that information together to determine whether two vertices can be connected by a path? 

Connectivity Terminology. Let v and w be vertices of a directed graph. Vertex v is connected to vertex w if there is a path from v to w. Two 

vertices are strongly connected if they are connected in both directions to one another. A graph is connected if, for each pair of distinct vertices, 
V and w, v is connected to w or w is connected to v. A graph is strongly connected if every pair of its vertices is strongly connected. For an 
undirected graph, in which edges can be used in either direction, the notions of strongly connected and connected are the same. 

Theorem 93.1. If a graph has n vertices and vertex u is connected to vertex w, then there exists a path from u to w of length no more 
than n. 

Proof (Indirect): Suppose u is connected to w, but the shortest path from u to w has length m, where m> n. A vertex list for a path of 
length m wiU have m + 1 vertices. This path can be represented as (vq, vj , . . . , v„), where vq = m and v„ = w. Note that since there are only 
n vertices in the graph and m vertices are listed in the path after vo, we can apply the pigeonhole principle and be assured that there must be 
some duplication in the last m vertices of the vertex Ust, which represents a circuit in the path. This means that our path of minimum length 
can be reduced, which is a contradiction. ■ 

Methods for Testing Connectivity 

Questions: 

Question 1 : Given a graph and two vertices in the graph, is there a path from the first vertex to the second? 
Question 2: If the answer to Question 1 is "yes" then what is the path? 
Some Answers 

Method 1: Adjacency Matrix Method. Suppose that the information about edges in a graph is stored in an adjacency matrix, G. The relation, 
r, that G defines is v r w if there is an edge connecting v to w. Recall that the composition of r with itself, r^ , is defined by v w if there exists a 
vertex y such that vry and yrw; that is, v is connected to w by a path of length 2. We could prove by induction that the relation , ^ > 1 , is 

defined by vr* w if and only if there is a path of length k from v to w. Since the transitive closure, r+ , is the union of r, , r^ , we can 
answer our connectivity question by determining the transitive closure of r, which can be done most easily by keeping our relation in matrix 
form. Theorem 9.3.1 is significant in our calculations because it tells us that we need only go as far as E" to determine the matrix of the 
transitive closure. 

The main advantage of the adjacency matrix method is that the transitive closure matrix can answer all questions about the existence of paths 
between vertices. If G+is the matrix of the transitive closure, v,-, is connected to vj if (E*)jj = 1 . A directed graph is connected if (E*)ij = 1 or 
(_E'^)ji = 1 for each ; t j. A directed graph is strongly connected if its transitive closure matrix has no zeros. 

A disadvantage of the adjacency matrix method is that the transitive closure matrix tells us whether a path exists, but not what the path is. 
Method 2: Broadcasting. We wiU describe this method first with an example. 

Example 93.1. The football team at Mediocre State University (MSU) has had a bad year, 2 wins and 9 losses. Thirty days after the end of the 
football season, the university trustees is meeting to decide whether to rehire the head coach; things look bad for him. However, on the day of 
the meeting, the coach releases the following list of results fi-om the past year: 

Mediocre State defeated Local A&M. 
Local A&M defeated City College. 
City College defeated Corn State U. 
... (25 results later) 

Tough Tech defeated Enormous State University (ESU). 
And ESU went on to win the national championship! 

The trustees were so impressed that they rehired the coach with a raise in pay! How did the coach come up with such a Ust? 

In reality, such lists exist occasionally and appear in newspapers from time to time. Of course they really don't prove anything since each team 
that defeated MSU in our example above can produce a similar chain of results. Since college football records are readily available, the coach 
could have found this list by trial and error. AH that he needed to start with was that his team won at least one game. Since ESU lost one game, 
there was some hope of producing the chain. 
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The problem of finding this list is equivalent to finding a path in the tournament graph for last year's football season that initiates at MSU and 
ends at ESU. Such a graph is far from complete and would be represented using edge lists. To make the coach's problem interesting, let's 
imagine that only the winner of a game remembers the result of the game. The coach's problem has now taken on the flavor of a maze. To reach 
ESU, he must communicate with the various teams along the path. One way that the coach could have discovered his list in time is by sending 
the following emails to the coaches of the two teams that MSU defeated during the season: 

When this example was first written, we commented that ties shouid be ignored. Most recent NCAA ruies caii for a tiebreai<er in coiiege footbaii and so ties are no 
longer an issue. Email was also not common and we described the process in terms of letter, not email messages. The coach could also have asked the MSU math 
department to use Mathematica or Sage to get the path! 



Dear Football Coach: 

Please follow these directions exactly. 

(1) If you are the coach at ESU, call the coach at MSU now and tell him who sent 
you this message. 

(2) If you are not the coach at ESU and this is the first message of this type that 
you have received, then: 

(a) Remember who you received this message from. 

(b) Forward a copy of this message, signed by you, to each of the coaches 
whose teams you defeated during the past year. 

(3) Ignore this message if you have received one like it already. 



Signed, 



Coach of MSU 

Observations: From the conditions of this message, it should be clear that if everyone cooperates and if coaches participate within a day of 
receiving the message: 

(a) If a path of length n exists from MSU to ESU, then the coach will know about it in n days. 

(b) By making a series of phone calls, the coach can obtain the path that he wants by first calling the coach who defeated ESU (the person who 
sent ESU's coach that message). This coach will know who sent him a letter, and so on. Therefore, the vertex list of the desired path is obtained 

in reverse order. 

(c) If a total of M football games were played, no more than M letters will be sent out. 

(d) If a day passes without any letter being sent out, no path from MSU to ESU exists. 

(e) This method could be extended to obtain a list of all teams that a given team can be connected to. Simply imagine a series of letters like the 

one above sent by each football coach and targeted at every other coach. 

The general problem of finding a path between two vertices in a graph, if one exists, can be solved exactly as we solved the problem above. The 
following algorithm is commonly called a breadth-first search. 

Algorithm 9.3.1. A broadcasting algorithm for finding a path between vertex ; and vertex j of a graph having n vertices. The each item Vj. of a 
list V = lVi, V2, V„), consist of a Boolean field Vi^Sonnd and an integer field Vj.from. The sets Di, D2, called depth sets, have the 
property that if k is in , then the shortest path from vertex ; to vertex k is of length r. In Step 5, a stack is used to put the vertex list for the 
path from the vertex i to vertex j in the proper order. 



1. Set the value Ujc. found equal to False, k = 1, 2, 

2. r = 

3. Do = {i} 

4. While Vj. found) and {Dri=0) 

4.1 Dr^i =0 

4 . 2 For each k in Dr do 

For each edge (k,t) do 
If Vf found == False 
then Vt . found = True 
f^rom = k 

Dr+l = D^+lU it] 

4.3 r = r + 1 

5. If Vj. found = True Then 

5.1 S = Empty Stack 

5.2 k=j 

5.3 While V;c.from ^ i 

5.3.1 Push k onto S 

5.3.2 k = Vk.trom 



Notes on Algorithm 9.3.1: 

(a) This algorithm wMl produce one path from vertex / to vertex j, if one exists, and that path wMl be as short as possible. If more than one path 
of this length exists, then the one that is produced depends on the order in which the edges are examined and the order in which the elements of 
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D,. are examined in Step 4. 

(b) The condition £),. is analogous to the condition that if no mail is sent in a given stage of the process, in which case MSU cannot be 
connected to ESU. 

(c) This algorithm can be easily revised to find paths to all vertices that can be reached from vertex i. Step 5 would be put off until a specific 
path to a vertex is needed since the information in V contains an efficient list of all paths. The algorithm can also be extended further to find 
paths between any two vertices. 

Example 9.3.2. Consider the graph in Figure 9.3.1. The existence of a path from vertex 2 to vertex 3 is not difficult to determine by 
examination. After a few seconds, you should be able to find two paths of length four. Algorithm 9.3.1 will produce one of them. 



3 



6 



Figure 9.3.1 

Suppose that the edges from each vertex are sorted in ascending order by terminal vertex. For example, the edges from vertex 3 would be in 
the order (3, 1), (3, 4), (3, 5). In addition, assume that in the body of Step 4 of the algorithm, the elements of Dr are used in ascending 
order. Then at the end of Step 4, the value of V will be 

k 1 2 3 4 5 6 

Vj. .found J- J- J- J- J- J- 

Vt..from 2 4 6 1 1 4 

Depth set 1 3 4 2 2 3 (value of r for which /t e A ) 

Therefore, the path (2, 1, 4, 6, 3) is produced by the algorithm. Note that if we wanted a path from 2 to 5, the information in V produces the 
path (2, 1, 5) since V^.from = 1 and Vi.from = 2. A shortest circuit that initiates at vertex 2 is also available by noting that V2.from = 4, 
V4.from = 1 , and Vi.from = 2; thus the circuit (2, 1, 4, 2) is obtained. 

Mathematica Note 

Consider the following defined by a adjecency matrix. 
SeedRandoin[2014] ; 

adj = Randomlnteger [{0, 1}, {8, 8}] * Table [Boole [i stj], {i, 1, 8}, {j, 1, 8}] 
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g = Adj acencyGraph [adj , VertexLabels -* "Name", ImagePadding 5] 
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Is 1 is connected to 8 in this graph? The function FindShortestPath will give us a shortest: 
FindShortestPath [g, 1, 8] 

(1, 2,4, 8) 

If we are interested in reaching vertex 8 from all vertices, we can do that just as easily. 
Map[{tt, FindShortestPath [g. All, 8] [B] } &, Range[8]] 



( 1 


|1, 6,4, 


81 ^ 


2 


|2, 4, 8 


1 


3 


|3, 2,4, 


81 


4 


|4,8) 




5 


|5,8) 




6 


|6, 4, 8 


1 


7 


|7, 2,4, 


81 


k8 


181 


) 



Finally, here is a matrix of shortest paths between any two vertices in the graph. 

Map [Apply [FindShortestPath [g. All, All], tt] &, Outer [List, Range[l, 8], Range[l, 8]], {2}] 



HI 


11,21 


11,31 


(1,2,41 


(1, 2, 4, 51 


(1,61 


(1,6,71 


(1, 2,4, 81 \ 


|2, 4, 3, 11 


121 


(2,4, 31 


(2,41 


(2,4,51 


(2,4,61 


(2,4, 3,71 


(2, 4, 81 


|3, H 


13,21 


131 


(3,2,41 


(3, 2, 4, 51 


(3, 1,61 


(3,71 


(3, 2,4, 81 


|4,3, H 


(4, 3, 21 


|4, 31 


(41 


(4,51 


(4,61 


(4,3,71 


(4,81 


|5,7, H 


15,21 


(5, 8, 31 


(5,41 


(51 


(5,61 


(5,71 


(5,81 
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(6, 1,21 


(6,31 


(6,41 


(6,4,51 


(61 


(6,71 


(6, 4, 81 


|7, U 


17,21 


(7,31 


(7,2,41 


(7, 2,4, 51 


(7, 1,61 


(71 


(7, 2,4, 81 


|8,3, U 


(8, 3, 21 


(8,31 


(8,5,41 


(8,51 


(8, 5, 61 


(8, 3, 71 


(81 



If no path exists between two vertices, the result is an empty list, as illustrated by this simple example. 

h = Graph [{1 ^2, 2^3, 3-»4}]; 
FindShortestPath [h, 4, 1] 

(1 
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Sage Note 

Here is some sage code that 

1. Generates a random undireced graph with 18 vertices. For each pair of vertices, an edge is included between them with probability 0.2. 
Since there are I ^ I = 153 potential edges, we expect that there will be approximately 0.2yl53s;31 edges. 

2. Counts the number of edges. In this case the number is a bit less than expected. 

3. Finds a shortest path from vertex to vertex 8. 

4. Generates a list of vertices that would be reached in a breadth-first search. The expression Gr . depth_f irst_search ( ) creates an 
iterator that is convenient for programming. Wrapping list( ) around the expression shows the order in which the vertices are visited. 

5. Generates a list of vertices that would be reached in a depth- first search. In this type of search you travel in one direction away from the 
starting point until no further new vertices. We will discuss this search later. 

sage: Gr=graphs .RandomGNP( 18,0.2); 
Gr.show( ) 




sage: len ( Gr . edges ( labels=False) ) 
25 

sage: Gr . shortest_path ( , 8) 

[0, 10, 14, 8] 

sage : list (Gr .breadth_f irst_search( ) ) 

[0, 17, 10, 11, 4, 13, 14, 3, 15, 7, 

sage : list (Gr . depth_f irst_search ( ) ) 

[0, 11, 15, 12, 5, 6, 7, 14, 10, 9, 3, 4, 17, 13, 1, 16, 8, 2\ 



9, 16, 1, 12, 2, 6, 5] 



EXERCISES FOR SECTION 9.3 
A Exercises 

1. Apply Algorithm 9.3.1 to find a path from 5 to 1 in Figure 9.3.1. What would be the final value of V? Assume that the terminal vertices in 
edge lists and elements of the depth sets are put into ascending order, as we assumed in Example 9.3.1 . 

2. Apply Algorithm 9.3.1 to find a path from the bedroom to outside using the edge list data structure in Example 9.2.1. Assume that the 
elements of the depth sets are put into ascending order. 

3. In a simple undirected graph with no self-loops, what is the maximum number of edges you can have, keeping the graph unconnected? 
What is the minimum number of edges that will assure that the graph is connected? 

4. Use a broadcasting algorithm to determine the shortest path from vertex a to vertex i in the graphs shown in Figure 9.3.2. List the depth sets 
and the stack that is obtained. 
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(b) 
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Figure 9.3.2 

Exercise 4 

B Exercise 

5. Prove (by induction on k) that if the relation r on vertices of a graph is defined by v r w if there is an edge connecting v to h', then ,k > 1 , 
is defined by v r* w if there is a path of length k from v to w. 
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I 9.4 Traversal: Eulerian and Hamiltonian Graphs 



The subject of graph traversals has a long history. In fact, the solution by Leonhard Euler (Switzerland, 1707-83) of the Konigsberg Bridge 
Problem is considered by many to represent the birth of graph theory. 



The Konigsberg Bridge Problem and Euierian Graphs 




Figure 9.4.1 

Map of Konigsberg 

A map of the Prussian city of Konigsberg (circa 1735) in Figure 9.4.1 shows that there were seven bridges connecting the four land masses that 
made up the city. The legend of this problem states that the citizens of Konigsberg searched in vain for a walking tour that passed over each 
bridge exactly once. No one could design such a tour and the search was abruptly abandoned with the publication of Euler's Theorem. 

Theorem 9.4.1: Euler' s Theorem — Konigsberg Case. No walking tour of Konigsberg can be designed so that each bridge is used 
exactly once. 




Figure 9.4.2 

Multigraph representation of Konigsberg 

Proof: The map of Konigsberg can be represented as an undirected multigraph, as in Figure 9.4.2. The four land masses are the vertices 
and each edge represents a bridge. The desired tour is then a path that uses each edge once and only once. Since the path can start and end at 
two different vertices, there are two remaining vertices that must be intermediate vertices in the path. If x is an intermediate vertex, then every 
time that you visit x, you must use two of its incident edges, one to enter and one to exit. Therefore, there must be an even number of edges 
connecting x to the other vertices. Since every vertex in the Konigsberg graph has an odd number of edges, no tour of the type that is desired 
is possible. ■ 

As is typical of most mathematicians, Euler wasn't satisfied with solving only the Konigsberg problem. His original theorem, which is para- 
phrased below, concerned the existence of paths and circuits like those sought in Konigsberg. These paths and circuits have become associated 
with Euler's name. 

Definitions: Eulerian Paths, Circuits, Graphs. A Eulerian path through a graph is a path whose edge list contains each edge of the 
graph exactly once. If the path is a circuit, then it is called a Eulerian circuit. A Eulerian graph is a graph that possesses a Eulerian path. 

Example 9.4.1. Without tracing any paths, we can be sure that the graph below has an Eulerian circuit because all vertices have an even 
degree. This follows from the following theorem. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlikes 3.0 United States License. 



Chapter 9 - Graph Theory 



201 




Theorem 9.4.2: Euler's Theorem — General Case. An undirected graph is Eulerian if and only if it is connected and has either zero or 
two vertices with an odd degree. If no vertex has an odd degree, then the graph has a Eulerian circuit. 

Proof: It can be proven by induction that the number of vertices in an undirected graph that have an odd degree must be even. We will leave the 
proof of this fact to the reader as an exercise. The necessity of having either zero or two vertices of odd degree is clear from the proof of the 
Konigsberg case of this theorem. Therefore, we will concentrate on proving that this condition is sufficient to ensure that a graph is Eulerian. 
Let k be the number of vertices with odd degree. 

Phase 1 .\ik = 0, start at any vertex, vq, and travel along any path, not using any edge twice. Since each vertex has an even degree, this path 
can always be continued past each vertex that you reach except vq. The result is a circuit that includes vq. If k = 2, let vq be either one of the 
vertices of odd degree. Trace any path starting at vq using up edges until you can go no further, as in the ^ = case. This time, the path that 
you obtain must end at the other vertex of odd degree that we will call vi . At the end of Phase 1 , we have an initial path that may or may not 
be Eulerian. If it is not Eulerian, Phase 2 can be repeated until all of the edges have been used. Since the number of unused edges is decreased 
in any use of Phase 2, a Eulerian path must be obtained in a finite number of steps. 

Phase 2. As we enter this phase, we have constructed a path that uses a proper subset of the edges in our graph. We will refer to this path as the 
current path. Let V be the vertices of our graph, E the edges, and E„ the edges that have been used in the current path. Consider the graph 
G' = (V, E — £„). Note that every vertex in G' has an even degree. Select any edge, e, from G'. Let vv, and be the vertices that e connects. 
Trace a new path starting at v„ whose first edge is e. We can be sure that at least one vertex of the new path is also in the current path since 
(V, E) is connected. Starting at v„, there exists a path in (V, E) to any vertex in the current path. At some point along this path, which we can 
consider the start of the new path, we will have intersected the current path. Since the degree of each vertex in G' is even, any path that we start 
at Vn can be continued until it is a circuit. Now, we simply augment the current path with this circuit. As we travel along the current path, the 
first time that we intersect the new path, we travel along it (see Figure 9.4.3). Once we complete the circuit that is the new path, we resume the 
traversal of the current path. 
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Figure 9.4.3 

Augmenting tiie current path in tine proof of Tfieorem 9.4.2 
If the result of this phase is a Eulerian path, then we are finished; otherwise, repeat this phase. # 

Example 9.4.2. The complete undirected graphs ^2 and K2„+i,n = 1, 2, 3, , are Eulerian. If n > 1, then A'2n is not Eulerian. 

HAMILTONIAN GRAPHS 

To search for a path that uses every vertex of a graph exactly once seems to be a natural next problem after you have considered Eulerian 
graphs. The Irish mathematician Sir William Hamilton (1805-65) is given credit for first defining such paths. He is also credited with discover- 
ing the quaternions, for which he was honored by the Irish government with a postage stamp in 2004. 




Definition: Hamiltonian Paths, Circuits, and Graphs. A Hamiltonian path through a graph is a path whose vertex list contains each 
vertex of the graph exactly once, except if the path is a circuit, in which case the initial vertex appears a second time as the terminal vertex. If 
the path is a circuit, then it is called a Hamiltonian circuit. A Hamiltonian graph is a graph that possesses a Hamiltonian path. 

Example 9.4.3. Figure 9.4.4 shows a graph that is Hamiltonian. In fact, it is the graph that Hamilton used as an example to pose the 
question of existence of Hamiltonian paths in 1859. In its original form, the puzzle that was posed to readers was called "Around the World." 
The vertices were labeled with names of major cities of the world and the object was to complete a tour of these cities. The graph is also 
referred to as the dodecahedron graph, where vertices correspond with the corners of a dodecahedron and the edges are the edges of the solid 
that connect the corners. 
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Figure 9.4.4 

The dodecahedron graph, a Hamiltonian graph. 




Figure 9.4.5 

The regular dodecahedron 

Unfortunately, a simple condition doesn't exist that characterizes a Hamiltonian graph. An obvious necessary condition is that the graph be 
connected; however, there is a connected undirected graph with four vertices that is not Hamiltonian. Can you draw such a graph? 

A Note on What Is Possible and What Is Impossible. The search for a Hamiltonian path in a graph is typical of many simple-sounding 
problems in graph theory that have proven to be very difficult to solve. Although there are simple algorithms for conducting the search, they are 
impractical for large problems because they take such a long time to complete as graph size increases. Currently, every algorithm to search for a 
Hamiltonian path in a graph takes exponential time to complete. That is, if T (n) is the time it takes to search a graph of n vertices, then there is 
a positive real number a, a > 1, such that T (n) > a" for all but possibly a finite number of positive values for n. No matter how close to 1 we 
can make a, a" will grow at such a fast rate that the algorithm will not be feasible for large values of n. For a given algorithm, the value of a 
depends on the relative times that are assigned to the steps, but in the search for Hamiltonian paths, the actual execution time for known 
algorithms is large with 20 vertices. For 1,000 vertices, no algorithm is likely to be practical, and for 10,000 vertices, no currently known 
algorithm could be executed. 

It is an unproven but widely held belief that no faster algorithm exists to search for Hamiltonian paths. A faster algorithm would have to be one 

that takes only polynomial time; that is, T(n) < p(n), for some polynomial sequence p. To sum up, the problem of determining whether a 

graph is Hamiltonian is theoretically possible; however, for large graphs we consider it a practical impossibility. Many of the problems we will 

discuss in the next section, particularly the Traveling Salesman Problem, are thought to be impossible in the same sense. 



Definition: The n-cube. Let n > 1, and let B" be the set of strings ofO's and I's with length n. The n-cube is the undirected graph with 
a vertex for each string in B" and an edge connecting each pair of strings that differ in exactly one position. 

The 1-cube, 2-cube, 3-cube, and 4-cube are shown in Figure 9.4.6. 
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Figure 9.4.6 

n-cubes, n = 1, 2, 3, 4 



The Gray Code. A Hamiltonian circuit of the n-cube can be described recursively. The circuit itself, called the Gray Code, is not the only 
Hamiltonian circuit of the n-cube, but it is the easiest to describe. The standard way to write the Gray Code is as a column of strings, where the 
last string is followed by the first string to complete the circuit. 

'0 



Basis (n = 1): The Gray Code for the 1-cube is Gi = 



Note that the edge between and 1 is used twice in this circuit. That doesn't 



violate any rules for Hamiltonian circuits, but can only happen if a graph as two vertices. 

Recursion: Given the Gray Code for the «-cube, n > I, then G„+i is obtained by (1) listing G„ with each string prefixed with 0, and then (2) 
reversing the list of strings in G„ with each string prefixed with 1 . Symbolically, the recursion can be expressed as 

0G„ 



G, 



C 0G„ 
- U G/ 



where G„' is the reverse of list G„. The Gray Codes for the 2-cube and 3-cube are 



(00 

01 

11 
I 10 j 



and Gi 



(000 
001 
Oil 
010 
110 
111 
101 

I 100 



Applications of the Gray Code. One application of the Gray code was discussed in the Introduction to this book. An other application is in 
statistics. In a statistical analysis, there is often a variable that depends on several factors, but exactly which factors are significant may not be 
obvious. For each subset of factors, there would be certain quantities to be calculated. One such quantity is the multiple correlation coefficient 
for a subset. If the correlation coefficient for a given subset. A, is known, then the value for any subset that is obtained by either deleting or 
adding an element to A can be obtained quickly. To calculate the correlation coefficient for each set, we simply travel along G„, where n is the 
number of factors being studied. The first vertex will always be the string of O's, which represents the empty set. For each vertex that you visit, 

the set that it corresponds to contains the factor if the character is a 1 . 
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EXERCISES FOR SECTION 9.4 
A Exercises 

1 . Locate a map of New York City and draw a graph that represents its land masses, bridges and tunnels. Is there a Eulerian path through New 
York City? You can do the same with any other city that has at least two land masses. 

2. Which of the drawings in Figure 9.4.7 can be drawn without removing your pencil from the paper and without drawing any line twice? 




Figure 9.4.7 

Exercise 2 

3. Write out the Gray Code for the 4-cube. 

4. Find a Hamiltonian circuit for the dodecahedron graph in Figure 9.4.4. 

5 . The Euler Construction Company has been contracted to construct an extra bridge in Konigsberg so that a Eulerian path through the town 
exists. Can this be done, and if so, where should the bridge be built? 

6. (a) Determine which of the graphs in Figure 9.4.8 has a Eulerian path? 

(b) Find a Eulerian path for the graphs that have one. 
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(b) 




Figure 9.4.8 

Exercise 6 

B Exercises 

7. Formulate Euler's theorem for directed graphs. 

8. Prove that the number of vertices in an undirected graph with odd degree must be even. (Hint: Prove by induction on the number of edges.) 

9. (a) Under what conditions will a round-robin tournament graph be Eulerian? 
(b) Prove that every round-robin tournament graph is Hamiltonian. 

10. For what values of n is n-cube Eulerian. 
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I 9.5 Graph Optimization 

The common thread that connects all of the problems in this section is the desire to optimize (maximize or minimize) a quantity that is associ- 
ated with a graph. We will concentrate most of our attention on two of these problems, the Traveling Salesman Problem and the Maximum 
Flow Problem. At the close of this section, we will discuss some other common optimization problems. 

Definition: Weighted Graph. A weighted graph, (V, E, w), is a graph (V, E) together with a weight function w : E -> R. If e e E, 
w (e) is the weight on edge e. 

As you will see in our examples, w (e) is usually a cost associated with the edge e; therefore, most weights will be positive. 

Example 9.5.1. Let V be the set of six capital cities in New England: Boston, Augusta, Hartford, Providence, Concord, and Montpelier. Let E 
be {{a, b] e V xV \ a + b); that is, (V, E) is a complete unordered graph. An example of a weight function on this graph is 

H'(ci, ci) = the distance from C] to C2 . 
Many road maps define distance functions as in Figure 9.5 .1 . 





ME 


MA 


NH 


CT 


VT 


RI 


Augusta, ME 




165 


148 


266 


190 


208 


Boston, MA 


165 




75 


103 


192 


43 


Concord, NH 


148 


75 




142 


117 


109 


Hartford, CT 


266 


103 


142 




204 


70 


Montpelier, VT 


190 


192 


117 


204 




223 


Providence, RI 


208 


43 


109 


70 


223 





FIGURE 9.5.1 

Distances between capital cities of New England 



The Traveling Salesman Problem 

The Traveling Salesman Problem is, given a weighted graph, to find a circuit {ei, ej, e„) that visits every vertex at least once and mini- 
mizes the sum of the weights, 

)i 

E Med 

i=l 

Any such circuit is called an optimal path. 

Notes, (a) Some statements of the Traveling Salesman Problem require that the circuit be Hamiltonian. In many applications, the graph 
in question will be complete and this restriction presents no problem. 

(b) If the weight on each edge is constant, for example, w(e) = 1 , then the solution to the Traveling Salesman Problem will be any Hamilto- 
nian circuit, if one exists. 

Example 9.5.2. The Traveling Salesman Problem gets its name from the situation of a salesman who wants to minimize the number of 
miles that he travels in visiting his customers. For example, if a salesman from Boston must visit the other capital cities of New England, then 
the problem is to find a circuit in the weighted graph of Example 9.5.1. Note that distance and cost are clearly related in this case. In 
addition, tolls and traffic congestion might also be taken into account in this case. 

The search for an efficient algorithm that solves the Traveling Salesman has occupied researchers for years. If the graph in question is 
complete, there are (n - 1) ! different circuits. As n gets large, it is impossible to check every possible circuit. The most efficient algorithms for 
solving the Traveling Salesman Problem take an amount of time that is proportional to n2" . Since this quantity grows so quickly, we can't 
expect to have the time to solve the Traveling Salesman Problem for large values of n. Most of the useful algorithms that have been developed 
have to be heuristic; that is, they find a circuit that should be close to the optimal one. One such algorithm is the "closest neighbor" algorithm, 
one of the earliest attempts at solving the Traveling Salesman Problem. The general idea behind this algorithm is, starting at any vertex, to visit 
the closest neighbor to the starting point. At each vertex, the next vertex that is visited is the closest one that has not been reached. This 
shortsighted approach typifies heuristic algorithms called greedy algorithms, which attempt to solve a minimization (maximization) problem by 
minimizing (maximizing) the quantity associated with only the first step. 



Algorithm 9.5.1. The Closest Neighbor Algorithm. Let G = {V, E, w) be a complete weighted graph with | V | = n. The closest 
neighbor circuit through G starting at V I is(vj, Vj, ...,v„), defined by the steps: 

1. V, = V-(v,). 

2 . For k = 2 ton - I 

2.1 Vj, = the closest vertex in Vj,_; tovi,_] 

(* Mvt-i , v^) = min(w(v,,_, ,v) \ v e V^_;) *) 

In case of a tie for closest, v^. may be chosen arbitrarily. 

2.2 V,= V,_, - [V, ] 

3. v„ = the only element of Vn. 



The cost of the closest neighbor circuit is 
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Z MVk, Vt+i) + w(v„, Vi) 

Example 9.5.3. The closest neighbor circuit starting at A in Figure 9.5.2 is (1, 3, 2, 4, 1), with a cost of 29. The optimal path is 
(1, 2, 3, 4, 1), with a cost of 27. 




FIGURE 9.5.2 

Example 9.5.3 

Although the closest neighbor circuit is often not optimal, we may be satisfied if it is close to optimal. If Copt and Ccn are the costs of optimal 

c 

and closest neighbor circuits in a graph, then it is always the case that Copi < Ccn or ^ > 1. We can assess how good the closest neighbor 
algorithm is by determining how small the quantity ^ gets. If it is always near 1, then the algorithm is good. However, if there are graphs for 

C '^9 

which It IS large, then the algorithm may be discarded. Note that in Example 9.5.3, = — x 1.074. A 7% increase in cost may or may not 

Copt 27 

be considered significant, depending on the situation. 

Example 9.5.4. A salesman must make stops at vertices A, B, and C, which are all on the same one-way street. The graph in Figure 9.5.3 
is weighted by the function 

w(i, j) = the time it takes to drive from vertex ! to vertex 7. 

Note that if j is down the one-way street from /, then w (i, j) < w (j, i). The values of Copt , and Ccn are 20 and 32, respectively. Verify that 

c 

Ccn is 32 by using the closest neighbor algorithm. The value of = 1 .6 is significant in this case since our salesman would spend 60% 
more time on the road if he used the closest neighbor algorithm. 




FIGURE 9.5.3 

Example 9.5.4 

A more general result relating to the closest neighbor algorithm presumes that the graph in question is complete and that the weight function 
satisfies the conditions 

(1) w(x, v) = w(y, x) for all X, >> in the vertex set, and 

(2) w(x, y) + w(y, z) a w (x, for all x, j, z in the vertex set. 

The first condition is called the symmetry condition and the second is the triangle inequality. 
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1 

The following theorem's reference needs to be updated: ^^^^^ 

Theorem 9.5.1. If ( V, E, w) is a complete weighted graph that satisfies the symmetry and triangle inequality conditions, then 

Proof: See Liu, pages 105-109. 

Notes: (a) If | V | = 8, then this theorem says that Ccn can be no larger than twice the size of Copt; however, it doesn't say that the closest 

\\ae, (2 n)l C 

neighbor circuit will necessarily be that tar from an optimal circuit. The quantity — — ■ is called an upper bound for the ratio — ^. It tells us 

2 dpt 

only that things can't be any worse than the upper bound. Certainly, there are many graphs with eight vertices such that the optimal and closest 
neighbor circuits are the same. What is left unstated in this theorem is whether there are graphs for which the quantities in 9.5a are equal. If 

there are such graphs, we say that the upper bound is sharp. 

C [log (2 X 4)1 

(b) The value of in Example 9.5.4 is 1.6, which is greater than ^ = 1.5; however, the weight function in this example does not 

satisfy the conditions of the theorem. 

The Traveling Salesman Problem— Unit Square Version 

Example 9.5.5. A robot is programmed to weld joints on square metal plates. Each plate must be welded at prescribed points on the square. To 
minimize the time it takes to complete the job, the total distance that a robot's arm moves should be minimized. Let d(P, Q) be the distance 
between P and Q. Assume that before each plate can be welded, the arm must be positioned at a certain point Pq • Given a list of n points, we 
want to put them in order so that 

rf(Po,Pl) + diPuPl) +--+d(Pn-l,Pn)+ d(Pn,Po) 

is as small as possible. 

The type of problem that is outlined in Example 9.5.5 is of such importance that it is probably the most studied version of the Traveling 
Salesman Problem. What follows is the usual statement of the problem. Let [0, 1] = {x eR | <x< 1), and let S = [0, 1]^, the unit 

square. Given n pairs of real numbers (xi, yO, (X2, ^2) ix„, y„) in S that represent the n vertices of a K„ , find a circuit of the graph that 

minimizes the sum of the distances traveled in traversing the circuit. 

Since the problem calls for a circuit, it doesn't matter which vertex we start at; assume that we will start at (xi, yi). Once the problem is solved, 
we can always change our starting position. A function can most efficiently describe a circuit in this problem. Every bijection 
/ : {1, . . . , n} -» {1, . . . , n) with /(I) = 1 describes a circuit 

(Xl, Jl), (x^(2), Jy,,)), ...,(xy(„),y/(„)) 

Since there are (n - 1) ! such bijections, an examination of all possible circuits is not feasible for large values of n. 
One popular heuristic algorithm is the strip algorithm: 

Algorithm 9 J J: The Strip Algorithm. Given n points in the unit square: 
Phase 1: 

(1.1) Divide the square into |"V n/2 j vertical strips, as in Figure 9.5.4. Let d be the width of each strip. If a point lies on a boundary between 
two strips, consider it part of the left-hand strip. 

(1 .2) Starting from the left, find the first strip that contains one of the points. Locate the starting point by selecting the first point that is 

encountered in that strip as you travel from bottom to top. We will assume that the first point is (x\ , y\) 

(1.3) Alternate traveling up and down the strips that contain vertices until all of the vertices have been reached. 

(1.4) Return to the starting point. 
Phase 2: 

(2.1) Shift all strips d/2 units to the right (creating a small strip on the left). 

(2.2) Repeat Steps 1 .2 through 1 .4 of Phase 1 with the new strips. 

When the two phases are complete, choose the shorter of the two circuits obtained. 

Step 1 .3 needs a bit more explanation. How do you travel up or down a strip? In most cases, the vertices in a strip will be vertically distributed 
so that the order in which they are visited is obvious. In some cases, however, the order might not be clear, as in the third strip in Phase I of 

Figure 9.5.4. Within a strip, the order in which you visit the points (if you are going up the strip) is determined thusly: (x,-, >>;) precedes (x^, yjj 
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if y; < yj or if j, = yj and x; < xj . In traveling down a strip, replace < yj with > yj- 




Start Stnrl 



FIGURE 9.5.4 

The Strip Algorithm 

The selection of |"V'2/2 j strips was made in a 1959 paper by Beardwood, Halton, and Hammersley. It balances the problems that arise if the 

number of strips is too small or too large. If the square is divided into too few strips, some strips may be packed with vertices so that visiting 
them would require excessive horizontal motion. If too many strips are used, excessive vertical motion tends to be the result. An update on what 
is known about this algorithm is contained in the paper by K. J. Supowit, E. M. Reingold, and D. A. Plaisted. 

Since the construction of a circuit in the square consists of sorting the given points, it should come as no surprise that the strip algorithm 
requires a time that is roughly a multiple of n log n time units when n points are to be visited. 

The worst case that has been encountered with this algorithm is one in which the circuit obtained has a total distance of approximately ^ In 
(see Sopowit et al.). 



NETWORKS AND THE MAXIMUM FLOW PROBLEM 

Definition: Network. A network is a simple weighted directed graph that contains two distinguished vertices called the source and the 
sink with the property that the indegree of the source and outdegree of the sink are both zero. The weight function on a network is the capacity function. 



An example of a real situation that can be represented by a network is a city's water system. A reservoir would be the source, while a distribu- 
tion point in the city to all of the users would be the sink. The system of pumps and pipes that carries the water from source to sink makes up 
the remaining network. We can assume that the water that passes through a pipe in one minute is controlled by a pump and the maximum rate is 
determined by the size of the pipe and the strength of the pump. This maximum rate of flow through a pipe is called its capacity and is the 
information that the weight function of a network contains. 

Example 9.5.6. Consider the system that is illustrated in Figure 9.5.5. The numbers that appear next to each pipe indicate the capacity of 
that pipe in thousands of gallons per minute. This map can be drawn in the form of a network, as in Figure 9.5.6. 
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FIGURE 9.5.6 

Flow diagram for a city's water system 

The Maximum Flow Problem is derived from the objective of moving the maximum amount of water or other material from the source to the 
sink. To measure this amount, we define a flow as a function f : E ->R such that (1) the flow of material through any edge is nonnegative and 
no larger than its capacity: < /(e) < w (e), for all e e E; and (2) for each vertex other than the source and sink, the total amount of material 
that is directed into a vertex is equal to the total amount that is directed out: 

Z = E f(v,y) 

(x.v)eE (v.T)e£ (9.5b) 

Flow into V = Flow out of v 

The summation notation on the left of 9.5b represents the sum of the flows through each edge in E that has v as a terminal vertex. The right- 
hand side indicates that you should add all of the flows through edges that initiate at v. 

Theorem 9.52. Iff is a flow, then 

Z f (source, v) = J] f(v, sink) 

(source,v)iEE {v,sink)£E 



This common value is called the value of the flow. We will denote the value of a flow by V if). The value of a flow represents the amount of 
material that passes through the network with that flow. 

Proof. Subtract the right-hand side of 9.5b from the left-hand side. The result is: 

Flow into V - Flow out of v = 
Now sum up these differences for each vertex in V" = V - {source, sink). The result is 



Z E f(x,v)- E fiv,y) 
veV l,(.v,v)e£ (v,v)e£ 



(9.5c) 



Now observe that if an edge connects two vertices in V, its flow appears as both a positive and a negative term in 9.5c. This means that the only 
positive terms that are not cancelled out are the flows into the sink. In addition, the only negative terms that remain are the flows out of the 
source. Therefore, 

E f(v, sink) - E /(source, v) = ■ 

(v,smk)e£ {source,v)e£ 

MAXIMAL FLOWS 

Since the Maximum Flow Problem consists of maximizing the amount of material that passes through a given network, it is equivalent to 
finding a flow with the largest possible value. Any such flow is called a maximal flow. 

For the network in Figure 9.5.6, one flow is /i , defined by /i(ei) = 25, fiie2) = 20, fiej,) = 0, /i(e4) = 25, and fie^) = 20. The value of /i , 
V(/i), is 45. Since the total flow into the sink can be no larger than 50 (w (£4) -1- w (gj) = 30-1- 20), we can tell that /i is not very far from the 
solution. Can you improve on / at all? The sum of the capacities into the sink can't always be obtained by a flow. The same is true for the sum 
of the capacities out of the source. In this case, the sum of the capacities out of the source is 60, which obviously can't be reached in this 
network. 

A solution of the Maximum Flow Problem for this network is the maximal flow /2, where f2{ei) = 25, /2(e2) = 25, f2{e3) = 5, /2(e4) = 30, and 

72(^5) = 20, with V(j2) = 50. This solution is not unique. In fact, there is an infinite number of maximal flows for this problem. 

There have been several algorithms developed to solve the Maximal Flow Problem. One of these is the Ford and Fulkerson Algorithm (FFA). 
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The FFA consists of repeatedly finding paths in a network called flow augmenting paths until no improvement can be made in the flow that has 
been obtained. 

Definition: Flow Augmenting Path. Given a flow f in a network (V, E), a flow augmenting path with respect to f is a simple path from 
the source to the sink using edges both in their forward and their reverse directions such that for each edge e in the path, w(e) - f (e) > 
if e is used in its forward direction and f (e) > if e is used in the reverse direction. 

Example 9.5.7. For f in Example 9.5 .6, a flow augmenting path would be (e2 , £3 , £4 ) since 

w(e2) - /i(<?2) = 15, w(e3) - f(e3) = 5, and w(e^) - f(e4) = 5. 

These positive differences represent unused capacities, and the smallest value represents the amount of flow that can be added to each edge in 
the path. Note that by adding 5 to each edge in our path, we obtain /2, which is maximal. If an edge with a positive flow is used in its reverse 
direction, it is contributing a movement of material that is counterproductive to the objective of maximizing flow. This is why the algorithm 
directs us to decrease the flow through that edge. 

Algorithm 9.53: The Ford and Fulkerson Algorithm. 

(1) Define the flow function fg by fgie) = for each edge e e E. 

(2) i = 0. 

( 3 ) Repeat: 

(3.1) If possible, find a fiow augmenting path with respect to f . 

(3.2) If a flow augmenting path exists, then: 

(3.2.1) Determine 

d =min{{w(e) - fie) \ e is used in the forward direction], 
[fie) I e is used in the reverse direction]] 

(3.2.2) Define f+i by 

fMie) = f(e) 
fM(e)=f(e)+d 
fM (e)=f(e)-d 

(3.2.3) i = i+\. 
until no flow augmenting path exists. 

(4) Terminate with a maximal flow f 
Notes: 

(a) It should be clear that every flow augmenting path leads to a flow of increased value and that none of the capacities of the network can be 
violated. 

(b) The depth-first search should be used to find flow augmenting paths since it is far more efficient than the breadth-first search in this 
situation. The depth-first search differs from the broadcasting algorithm (a variation of the breadth-first search) in that you sequentially visit 
vertices until you reach a "dead end" and then backtrack. 

(c) There have been networks discovered for which the FFA does not terminate in a finite number of steps. These examples all have irrational 
capacities. It has been proven that if all capacities are positive integers, the FFA terminates in a finite number of steps. See Ford and Fulkerson, 
Even, or Berge for details. 

(d) When you use the FFA to solve the Maximum Flow Problem by hand it is convenient to label each edge of the network with the fraction 
fiie)/w(e). 

A Depth-First Search for the Sink Initiating at the Source. Let E ' be the set of directed edges that can be used in producing a flow 
augmenting path. Add to the network a vertex called start and the edge (start, source). 

(1) S = vertex set of the network. 

(2) p = start. 

(3) p = source (*Move p along the edge (start, source) *) 

(4) While p is not equal to start or sink do. 

If an edge in E ' exists that takes you from p to another vertex in S 
then set p to be that next vertex and delete the edge from E '. 
else reassign p to be the vertex that p was reached from (i.e., backtrack) . 

(5) If p = start, 

then no flow augmenting path exists. 

else p = sink, you have found a flow augmenting path. 

Example 9.5.8. Consider the network in Figure 9.5.7, where the current flow, /, is indicated by a labeling of the edges. The path 
(Source, vt , vj, V3 , Sink) is a flow augmenting path that allows us to increase the flow by one unit. Note that (v\, vj) is used in the reverse 
direction, which is allowed because /(vi, V3) > 0. The value of the new flow that we obtain is 8. This flow must be maximal since the capacities 



if e IS not part of the flow augmenting path 
if e is used in the forward direction 
if e is used in the reverse direction 
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out of the source add up to 8. This maximal flow is defined by the labeling of Figure 9.5.8. 




i'2 4/4 — 1'3 



FIGURE 9.5.7 

Current flow in Example 9.5.8 




V2 4/4 » 



FIGURE 9.5.8 

Augmented, optimal flow in Example 9.5.8 

OTHER GRAPH-OPTIMIZATION PROBLEMS 

(a) The Minimum Spanning Tree Problem: Given a weighted graph, {V , E, w), find a subset E' of E with the properties that (V, E') is 
connected and the sum of the weights of edges in E' are as small as possible. We will discuss this problem in Chapter 10. 

(b) The Minimum Matching Problem: Given an undirected weighted graph, (KE, w), with an even number of vertices, pair up the vertices so 
that each pair is connected by an edge and the sum of these edges is as small as possible. A unit square version of this problem has been studied 
extensively. See References: [SopowitOOl] for details on what is known about this version of the problem. 

(c) The Graph Center Problem: Given a connected, undirected, weighted graph, find a vertex (the center) in the graph with the property that 
the distance from the center to every other vertex is as small as possible. "As small as possible" could be interpreted either as minimizing the 
sum of the distances to each vertex or as minimizing the maximum distance from the center to a vertex. 

EXERCISES FOR SECTION 9.5 
A Exercises 

1. Find the closest neighbor circuit through the six capitals of New England starting at Boston. If you start at a different city, will you get a 
different circuit? 

2. Is Theorem 9.5.1 sharp for n = 3?Forn = 4? 

3. Given the following sets of points in the unit square, find the shortest circuit that visits all the points and find the circuit that is obtained with 
the strip algorithm. 

(a) ((0.1 /t, OAk) : k = Q, 1, 2, 10) 

(b) ((0.1, 0.3), (0.3, 0.8), (0.5, 0.3), (0.7, 0.9), (0.9, 0.1)) 

(c) ((0.0, 0.5), (0.5, 0.0), (0.5, 1.0), (1.0, 0.5)) 

(d) ((0, 0), (0.2, 0.6), (0.4, 0.1), (0.6, 0.8), (0.7, 0.5)) 

4. For« = 4, 5, and 6, locate n points in the unit square for which the strip algorithm works poorly. 

5. Consider the network whose maximum capacities are shown on the following graph. 
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c 2 — d 




a 3 »- — g 



(a) A function /is partially defined on the edges of this network by: 

/(Source, c) = / (Source, b) =/ (Source, a) = 2, and /(a, d) = \. 
Define /on the rest of the other edges so that /is a flow. What is the value of / 

(b) Find a flow augmenting path with respect to/for this network. What is the value of the augmented flow? 

(c) Is the augmented flow a maximum flow? Explain. 

6. Given the following network with capacity function c and flow function/, find a maximal flow function. The labels on the edges of the 
network are of the form f {e)/c{e), where c(e)is the capacity of edge e and / (e) is the used capacity for flow /. 



A 3/6 » ■ B 




7. Find maximal flows for the following networks. 

(a) 



a 20 - b 
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B Exercises 

8. (a) [Easy] Find two maximal flows for the network in Figure 9.5.6 other than the one found in the text. 

(b) [Harder] Describe the set of all maximal flows for the same network. 

(c) [Hardest] Prove that if a network has two maximal flows, then it has an infinite number of maximal flows. 

9. Discuss reasons that the closest neighbor algorithm is not used in the unit square version of the Traveling Salesman Problem. (Hint: Count 
the number of comparisons of distances that must be done.) 

C Exercises 

10. Explore the possibility of solving the Traveling Salesman Problem in the "unit box": [0, 1]' . 

1 1 . Devise a "closest neighbor" algorithm for matching points in the unit square. 
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I 9.6 Planarity and Colorings 

The topics in this section are related to how graphs are drawn. 

Planarity: Can a given graph be drawn in a plane so that no edges intersect? Certainly, it is natural to avoid intersections, but up to now we 
haven't gone out of our way to do so. 

Colorings: Suppose that each vertex in an undirected graph is to be colored so that no two vertices that are connected by an edge have the same 
color. How many colors are needed? This question is motivated by the problem of drawing a map so that no two bordering countries are colored 
the same. A similar question can be asked for coloring edges. 

Definition: Planar Graph/ Plane Graph/Planar Embedding. A graph is planar if it can be drawn in a plane so that no edges cross. If a 
graph is drawn so that no edges intersect, it is a plane graph, and such a drawing is a planar embedding of the graph. 

Example 9.6.1 . The graph in Figure 9.6.1(a) is planar but not a plane graph. The same graph is drawn as a plane graph in Figure 9.6.1(b) 




Figure 9.6.1 

A planar graph and a planar embedding 

Notes: 

(a) In discussing planarity, we need only consider simple undirected graphs with no self-loops. All other graphs can be treated as such since all 
of the edges that relate any two vertices can be considered as one "package" that clearly can be drawn in a plane. 

(b) Can you think of a graph that is not planar? How would you prove that it isn't planar? Proving the nonexistence of something is usually 
more difficult than proving its existence. This case is no exception. Intuitively, we would expect that sparse graphs would be planar and dense 
graphs would be nonplanar. Theorem 9.6.2 will verify that dense graphs are indeed nonplanar. 

(c) The topic of planarity is a result of trying to restrict a graph to two dimensions. Is there an analogous topic for three dimensions? What 
graphs can be drawn in one dimension? 

Answer to Note c: If a graph has only a finite number of vertices, it can always be drawn in three dimensions. This is not true for all graphs with 
an infinite number of vertices. The only "one-dimensional" graphs are the ones that consist of a finite number of chains, as in Figure 9.6.2, with 
one or more vertices in each chain. 




Figure 9.6.2 

Chains of length one, two and three 



Example 9.6.2. A discussion of planarity is not complete without mentioning the famous Three Utilities Puzzle. The object of the puzzle is to 
supply three houses. A, B, and C, with the three utilities, gas, electric, and water. The constraint that makes this puzzle impossible to solve is 
that no utility lines may intersect i. e., a planar embedding of the graph in Figure 9.6.3, which is commonly denoted Kjj. This graph is one of 
two fundamental nonplanar graphs. The Kuratowski Reduction Theorem states that if a graph is nonplanar then "contains" either a 3 or a ^5. 
Containment is in the sense that if you start with a nonplanar graph you can always perform a sequence of edge deletions and contractions 
(shrinking an edge so that the two vertices connecting it coincide) to produce one of the two graphs. 
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Figure 9.6.3 

The Three Utilities Puzzle. 

A planar graph divides the plane into one or more regions. Two points on the plane lie in the same region if you can draw a curve connecting 
the two points that does not pass through an edge. One of these regions will be of infinite area. Each point on the plane is either a vertex, a point 
on an edge, or a point in a region. A remarkable fact about the geography of planar graphs is the following theorem that is attributed to Euler. 

Theorem 9.6.1: Euler's Formula. If G = (V, E) is a connected planar graph with r regions, v vertices and e edges, then 
V + r - e = 2 (9.6a) 

Experiment: Jot down a graph right now and count the number of vertices, regions, and edges that you have. If v + r - 6' is not 2, then your 
graph is either nonplanar or not connected. 

Proof: We prove Euler's Formula by Induction on e, for e > 0. 
Basis: If ^ = 0, then G must be a graph with one vertex, v = 1; and there is one infinite region, r = I. 
Therefore, v + r - e = 1 + 1-0 = 2, and the basis is true. 

Induction: Suppose that G has k edges, k > I, and that all connected planar graphs with less than k edges satisfy 9.6a. Select any edge that is 
part of the boundary of the infinite region and call it ^i. Let G' be the graph obtained from G by deleting ei. Figure 9.6.4 illustrates the two 
different possibilities we need to consider: either G' is connected or it has two connected components, Gi and G2. 




Case 2; 




Figure 9.6.4 

Two case in the proof of Euler's Formula 

If G' is connected, the induction hypothesis can be applied to it. If G' has v' vertices, r' edges and e' edges, then v' + r' - e' = 2 and in 
terms of the corresponding numbers for G, 

V ' = V No vertices were removed to form G ' 

r' = r - 1 One region of G merged with the infinite region when ei is removed 
e' = k-l We assumed that G had hedges. 

For the case where G ' is connected, 

V + r - e = V + r - k 

= v' + (r' + I) - {e' + 1) 
= v' + r' - e' 
= 2 

If G' is not connected, it must consist of two connected components, Gi and G2 since we started with a connected graph, G. We can apply 
the induction hypothesis to each of the two components to complete the proof. We leave it to the students to do this, with the reminder that 
in counting regions, Gi and G2 will share the same infinite region. ■ 

Theorem 9.6.2. If G = (V, E) is a connected planar graph with v vertices, v > 3, and e edges, then 
e < 3v - 6. (9.6b) 
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Remark: One implication of 9.6b is that the number of edges in a connected planar graph will never be larger than three times its 
number .of vertices (as long as it has at least three vertices). Since the maximum number of edges in a graph with v vertices is a quadratic 
function of v, as v increases, planar graphs are more and more sparse. 

Outline of a Proof of Theorem 9.6.2. 

(a) Let r be the number of regions in G. For each region, count the number of edges that comprise its border. The sum of these counts must 
be at least 3r. Recall that we are working with simple graphs here, so a region made by two edges connecting the same two vertices is not 
possible. 

(b) Based on (a), infer that the number of edges in G must be at least y . 

(c) e > — => r < — 

(d) Substitute ^ for ^ in Euler's Formula to obtain an inequality that is equivalent to 9.6 .b. ■ 
The following theorem will be useful as we turn to graph coloring. 

Theorem 9.63. If G is a connected planar graph, then it has a vertex with degree 5 or less. 

Proof (by contradiction): We can assume that G has at least seven vertices, for otherwise the degree of any vertex is at most 5. Suppose 
that G is a connected planar graph and each vertex has a degree of 6 or more. Then, since each edge contributes to the degree of two vertices, 

e > — = 3 V. However, Theorem 9.6.2 states that the e < 3 v - 6 < 3 v, which is a contradiction. ■ 

2 




Figure 9.6.5 

A 3-coloring of Euler Island 

The map of Euler Island in Figure 9.6.5 shows that there are seven towns on the island. Suppose that a cartographer must produce a colored 
map in which no two towns that share a boundary have the same color. To keep costs down, she wants to minimize the number of different 
colors that appear on the map. How many colors are sufficient? For Euler Island, the answer is three. This problem motivates a more general 
problem. 

The Graph Coloring Problem. Given an undirected graph G = (V, E), find a "coloring function" / from V into a set of colors H such that 
(v,, vy) eE => f{vi) 4^ f{vi) and H has the smallest possible cardinality. The cardinality of H is called the chromatic number of G, x (G). 

Notes: 

(a) A coloring function onto an n element set is called an n-coloring. 

(b) In terms of this general problem, the chromatic number of the graph of Euler Island is three. To see that no more than three colors are 
needed, we need only display a 3-coloring: /(I) = /(4) = /(6) = blue, /(2) = red, and /(3) = /(5) = /(7) = white. This coloring is not 
unique. The next smallest set of colors would be of two colors, and you should be able to convince yourself that no 2-coloring exists for this 
graph. 

In the mid-nineteenth century, it became clear that the typical planar graph had a chromatic number of no more than 4. At that point, mathemati- 
cians attacked the Four-Color Conjecture, which is that if G is any planar graph, then its chromatic number is no more than 4. Although the 
conjecture is quite easy to state, it took over 100 years, until 1976, to prove the conjecture in the affirmative. 

Theorem 9.6.4: The Four-Color Theorem. If G is a planar graph, then x(Cj) < 4. 
A proof of the Four-Color Theorem is beyond the scope of this text, but we can prove a theorem that is only 25 percent inferior. 

Theorem 9.6.5: The Five-Color Theorem. If G is a planar graph, then ziG) ^ 5. 

The number 5 is not a sharp upper bound for xiG) because of the Four-Color Theorem. 
Proof, by Induction on the Number of Vertices in the Graph: 
Basis: Clearly, a graph with one vertex has a chromatic number of 1 . 

Induction: Assume that all planar graphs with n - \ vertices have a chromatic number of 5 or less. Let G be a planar graph. By Theorem 9.6.2, 
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there exists a vertex v with deg v < 5. Let G - v be the planar graph obtained by deleting v and all edges that connect v to other vertices in G. 
By the induction hypothesis, G - v has a 5-coloring. Assume that the colors used are red, white, blue, green, and yellow. 

If deg V < 5, then we can produce a 5-coloring of G by selecting a color that is not used in coloring the vertices that are connected to v with an 
edge in G. 

If degv = 5, then we can use the same approach if the five vertices that are adjacent to v are not all colored differently. We are now left with 
the possibility that vi , V2, V3, V4, and V5 are all connected to v by an edge and they are all colored differently. Assume that they are colored red, 
white blue, yellow, and green, respectively, as in Figure 9.6.6. 



Red 




Figure 9.6.6 

Starting at vi in G - v, suppose we try to construct a path V3 that passes through only red and blue vertices. This can either be accomplished or 
it can't be accomplished. If it can't be done, consider all paths that start at vi, and go through only red and blue vertices. If we exchange the 
colors of the vertices in these paths, including vi we still have a 5-coloring of G - v. Since vi is now blue, we can color the central vertex, v, 
red. 

Finally, suppose that vi is connected to V3 using only red and blue vertices. Then a path from vi to V3 by using red and blue vertices followed by 
the edges (V3, v) and (v, vi) completes a circuit that either encloses V2 or encloses V4 and V5 . Therefore, no path from V2 to V4 exists using only 
white and yellow vertices. We can then repeat the same process as in the previous paragraph with V2 and V4 , which will allow us to color v 
white. ■ 

Definition: Bipartite Graph. A bipartite graph is a graph that has a 2-coloring. Equivalently, a graph is bipartite if its vertices can be 
partitioned into two nonempty subsets so that no edge connects a vertices from the same from each subset. 

Example 9.6.3. 

(a) The graph of the Three Utilities Puzzle is bipartite. The vertices are partitioned into the utilities and the homes. Of course a 2-coloring of 
the graph is to color the utilities red and the homes blue. 

(b) For n > 1, the n-cube is bipartite. A coloring would be to color all strings with an even number of I's red and the strings with an odd 
number of I's blue. By the definition of the n-cube, two strings that have the same color couldn't be connected since they would need to differ 
in at least two positions. 

(c) Let 't'be a set of 64 vertices, one for each square on a chess board. We can index the elements of Vby 

V; j = the square on the row column j. 
Connect vertices in V according to whether or not you can move a knight from one square to another. Using our indexing of V, 

(v,„ v„)e£ifandonly.f | - ^ | • | ^ - / 1 = 2 

(V, E) is a bipartite graph. The usual coloring of a chessboard is valid 2-coloring. 

How can you recognize whether a graph is bipartite? Unlike planarity, there is a nice equivalent condition for a graph to be bipartite. 
Theorem 9.6.6. An undirected graph is bipartite if and only if it has no circuit of odd length. 

Proof. (=>) Let G = (V , E) be a bipartite graph that is partitioned into two sets, R(ed) and B(lue) that define a 2-coloring. Consider any circuit 
in V. If we specify a direction in the circuit and define/on the vertices of the circuit by 

f (u) = the next vertex in the circuit after V. 
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Note that /is a bijection. Hence the number of red vertices in the circuit equals the number of blue vertices, and so the length of the circuit must 
be even. 

(<=) Assume that G has no circuit of odd length. For each component of G, select any vertex w and color it red. Then for every other vertex v 
in the component, find the path of shortest distance from w to v. If the length of the path is odd, color v blue, and if it is even, color v red. We 
claim that this method defines a 2-coloring of G. Suppose that it does not define a 2-coloring. Then let v„ and be two vertices with identical 
colors that are connected with an edge. By the way that we colored G, neither v,, nor vi, could equal w. We can now construct a circuit with an 
odd length in G. First, we start at w and follow the shortest path to . Then follow the edge (v„, v^), and finally, follow the reverse of a 
shortest path from w to v;,. Since v„ and have the same color, the first and third segments of this circuit have lengths that are both odd or 
even, and the sum of their lengths must be even. The addition of the single edge (v„, v^) shows us that this circuit has an odd length. This 
contradicts our premise. ■ 

Note: An efficient algorithm for finding a 2-coloring of a graph can be designed using the method that is used in the second part of the 
proof above. 

EXERCISES FOR SECTION 9.6 
A Exercises 

1 . Apply Theorem 9.6.2 to prove that once n gets to a certain size, a K„ is nonplanar. What is the largest complete planar graph? 

2. Can you apply Theorem 9.6.2 to prove that the Three Utilities Puzzle can't be solved? 

3. What are the chromatic numbers of the following graphs? 

(a) (b) 




(d) 




(e) (f) 



1 7 




4. Prove that if an undirected graph has a subgraph that is a it then its chromatic number is at least 3. 

5. What is x(K„),n> 1? 

6. What is the chromatic number of the United States? 

B Exercises 

7 . Complete the proof of Theorem 9 .6 . 1 . 

8. Use the outline of a proof of Theorem 9.6.2 to write a complete proof. Be sure to point out where the premise v > 3 is essential. 

9. Let G = (V, E) with | V | > 11, and let U be the set of all undirected edges between distinct vertices in V. Prove that either G or 
G' = (V, is nonplanar. 

10. Design an algorithm to determine whether a graph is bipartite. 

1 1 . Prove that a bipartite graph with an odd number of vertices greater than or equal to 3 has no Hamiltonian circuit. 
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C Exercises 

12. Prove that any graph with a finite number of vertices can be drawn in three dimensions so that no edges intersect. 

13. Suppose you had to color the edges of an undirected graph so that for each vertex, the edges that it is connected to have different colors. 
How can this problem be transformed into a vertex coloring problem? 

14. (a) Suppose the edges of a K(, are colored either red or blue. Prove that there will be either a "red ^"3" (a subset of the vertex set with three 
vertices connected by red edges) or a "blue K^." 

(b) Suppose six people are selected at random. Prove that either there exists a subset of three of them with the property that any two people in 
the subset can communicate in a common language, or there exist three people, no two of whom can communicate in a common language. 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 9 

Section 9.1 

1 . Determine which two of the graphs below are isomorphic and give an explicit isomorphism between those two graphs. 




2. Teams 1, 2, 3, and 4 compete in a round-robin tournament. Draw a round-robin tournament graph that represents one of the possible out 
comes of the tournament. In terms of your graph, what was the outcome of the tournament (in wins and losses)? 

3. Let G = (V, E) be an undirected graph. An independent set, W, is a subset of V having the property that no two vertices in W are connected 
by an edge in E. That is, 

V, w e W => |v, w] ^ E. 

Finding a maximal independent set (an independent set that is as large as possible) is often of interest. 

(a) Find a maximal independent set in the graph of Figure 9.1 .2. 

(b) Prove that if W is a maximal independent set in G, then every vertex in G is connected by an edge to at least one vertex in W. 

(c) How large can a maximal independent set in a K„ be? 

4. Let 5 = 1 1, 2, n| be a set of n cities. Define a matrix A = [a,y] of order nxn by a,y = Oif ; = j\ otherwise a,y is the number of distinct ways 
of traveling directly from city / to city j by car, without visiting any other cities belonging to S en route. 

In the following diagram, the points represent five cities, 1, 2, 3, 4, and 5, and a line is drawn between two cities and labeled with a positive 
integer giving the number of direct routes (by car) between the respective cities. 




(a) Determine the 5x5 matrix A going with the above diagram. 

(b) Calculate 

(c) Interpret the meaning of the entries in A^, both for the result in part b, and for the general set 5 = (1, 2, n}. 
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5. An undirected graph can be used to model a map of states or countries, where there is a vertex for each country and an edge between two 
vertices if the two countries share a boundary. For example, the undirected graph of Central America (mainland only) would have eight 
vertices. {Honduras, Nicaragua} would be an edge of the graph, but {Mexico, Nicaragua} would not be an edge. 




(a) Draw the graph of Central America. 

(b) Find a path from Mexico to Panama. 

(c) What significance would this path have if you intended to drive from Mexico to Panama? 

6. Show that the graph of the states of Washington, Oregon, Idaho, Montana, Wyoming, and Colorado is isomorphic to the graph of New 
England. 

7. The outdegree of a vertex in a directed graph is the number of vertices in the graph that start at that vertex, and the indegree of the vertex is 
the number of vertices in the graph that terminate at that vertex. A vertex v in a tournament graph is a source if the indegree of that vertex is 
zero. A sink is a vertex in a tournament graph that has outdegree of zero. 

(a) Prove that a round-robin tournament graph can have at most one source and at most one sink. Interpret these facts in terms of the 
results of the tournament. 

(b) What is the outdegree of a source in a round-robin tournament graph? What is the indegree of a sink in a round-robin tournament 
graph? 

(c) Let G = ( V, £) be a round-robin tournament graph with | V| > 2. If p is "G has a sink," and q is "G has a source," prove that any one of 
the propositions p f\^q, p f\q, p f\^q and p f\q could be true. 

Section 9.2 

8. Let C = (V, £) be a round-robin tournament graph with | V| = «. If M is the matrix of G, 

(a) For / = 1 , 2, «, explain why the number of I's in column i of M plus the number of Is in row i is always equal to n - 1. 

(b) How many 1 s are there in M? 
Section 9.3 

9. Use the broadcasting algorithm to determine a shortest path from vertex a to vertex i in the following graph. List the depth sets. 
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10. In a breadth-first (broadcasting) search for a path from vertex A to vertex J, what would the depth sets (D] , Dj, ...) be? 




Section 9.4 

Definition: Randomly Eulerian. Graph G is randomly Eulerian from vertex v if every path in G that initiates at v and uses edges at random is a 
Eulerian circuit. 

11. Give examples of undirected graphs that are randomly Eulerian from none, one, two, or all vertices. It can be proven that no graph is 
randomly Eulerian from more than two vertices unless it is randomly Eulerian from all vertices. 

12. Prove that G is randomly Eulerian from v if and only if every circuit in G contains v. 

13. Ore's Theorem states that if G = {V, E) is an undirected graph with | V| = n s 3 such that {v, w] ^E ^ degu + deg v s n, then G has a 
Hamiltonian circuit. Prove Ore's Theorem given the following out line. 

Proof by contradiction: 

(a) Add edges to E so that G still has no Hamiltonian circuit, but so that the addition of any other edge does produce a Hamiltonian 
circuit. Now select any |v, m') ^ E. There must be a path in G: 

V = (Vi, V2, V„) = W. 

(b) Prove that for 2^i tin, {vi , v,) e £ => |v;_i , v„) ^ E. 

(c) Conclude that deg v + deg w <n. 

Section 9.5 

14. Given: 



b 7 d 




(a) Determine the optimal path from the vertex a to the vertex z- 

(b) Use the broadcasting algorithm to find a path from ato z- 

(c) Find the "costs" of both paths and discuss whether the algorithm in part b gives a "good result." 
15 . Consider the network 
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We start with a flow/on this network partially defined by f{s, c) = f(s, b) = f(s, a) = I and/(rf, a) = 0. 

(a) Define /for each of the other edges in this network so as to achieve a flow. 

(b) Assume that the maximum capacity of each edge is 2. Determine a flow augmenting path with respect to the flow given in part a. 
(Use the Ford and Fulkerson Algorithm.) 

(c) Is the flow obtained in part b a maximal flow? Explain. 
16. Find a maximal flow for the following network: 




d ae — — e ts — - — / 



17. (a) Given the mileage chart below, what is the closest neighbor circuit starting at city A? 





A 


B 


C 


D 


E 


F 


A 




34 


50 


12 


51 


37 


B 


34 




40 


27 


70 


60 


C 


50 


40 




30 


33 


41 


D 


12 


27 


30 




39 


20 


E 


51 


70 


33 


39 




15 


F 


37 


60 


41 


20 


15 





(b) Suppose it takes {k - 1) seconds to determine which of k cities is closest to any given city. Estimate how long it would take to find a 
closest neighbor circuit through n cities. 

Section 9.6 

18. Draw three connected graphs, G2, G3, and G4, with four vertices, each having chromatic numbers 2, 3, and 4 (i.e. c(G;) = /. 

19. (a) Prove that although a is not planar, it can be drawn on a torus (a doughnut- shaped surface). 

(b) Can the Three Utilities Puzzle be solved on a torus? 

20. Draw examples of the following graphs, if possible. Clearly indicate which of the graphs are impossible and why they are impossible. You 
may cite any theorem that justifies your claim. 

(a) An undirected Hamiltonian graph that is not Eulerian. Explain why your graph is not Eulerian. 

(b) A bipartite Hamiltonian graph with five vertices. 

(c) A round-robin tournament graph with four vertices. 

(d) A connected planar graph with four vertices, three regions, and six edges. 

(e) An undirected graph with chromatic number five. 
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GOALS 

In this chapter we will study the class of graphs called trees. Trees are frequently used in both mathematics and the sciences. Our solution of 
Example 2.1 is one simple instance. Since they are often used to illustrate or prove other concepts, a poor background in trees can be a serious 
handicap. For this reason, our ultimate goals are to: (1) define the various common types of trees, (2) identify some basic properties of trees, 
and (3) discuss some of the common applications of trees. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-StiareAlilce Works 3.0 United States License. 



Chapter 10 - Trees 

I 10.1 What Is a Tree? 

What distinguishes trees from other types of graphs is the absence of certain paths called cycles. Recall that a path is a sequence of consecutive 
edges in a graph; and a circuit is a path that begins and ends at the same vertex. 

Definition: Cycle. A cycle is a circuit whose edge list contains no duplicates. 

The simplest example of a cycle in an undirected graph is a pair of vertices with two edges connecting them. Since trees are cycle-free, we can 
rule out all multigraphs from consideration as trees. 

Trees can either be undirected or directed graphs. We will concentrate on the undirected variety in this chapter. 

Definition: Tree. An undirected graph is a tree if it is connected and contains no cycles or self-loops. 
Example 10.1.1. 

(a) Graphs i, ii and iii in Figure 10.1 .1 are all trees, while graphs iv, v, and vi are not trees. 

(i) (iii) 





Figure 10.1.1 

Some trees and non-trees 

(b) A ^2 is a tree. However, if « > 3, a ^T,, is not a tree. 

(c) In a loose sense, a botanical tree is a mathematical tree. There are usually no cycles in the branch structure of a botanical tree. 

(d) The structures of some chemical compounds are modeled by a tree. For example, butane (Figure 10.1 .2a) consists of four carbon molecules 
and ten hydrogen molecules, where an edge between two molecules represents a bond between them. A bond is a force that keeps two 
molecules together. The same set of molecules can be linked together in a different tree structure to give us the compound isobutane (Figure 
10.1.2b). There are some compounds whose graphs are not trees. One example is benzene (Figure 10.1.2c). 



(b) (c) 



H H H H V H V- C = C 

^C^ H H C H / \ 

/ \ ^ \ / I H-C C-H 



H "C^ 

""h u " H-C-H 

I 

H 



Figure 10.1.2 

Simple organic compounds 

One type of graph that is not a tree, but is closely related, is a forest. 

Definition: Forest. A forest is an undirected graph whose components are all trees. 



C C 

/ \ 
H H 
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Example 10.1.2. The top half of Figure 10.1 .1 can be viewed as a forest of three trees. 

We will now examine several conditions that are equivalent to the one that defines a tree. The following theorem wiU be used as a tool in 
proving that the conditions are equivalent. 

Theorem 10.1.1. Let G = (V, E) be an undirected graph with no self-loops, and let Vq, v^, e V. If two different simple paths exist between 
Vo and Vj, , then there exists a cycle in G. 

Proof: Let pi = (ei, e2, .... e„) and p2 = ifi, fi /«) be two different simple paths from Va to Vj. The first step we wiU take is to 

delete from p\ and p2 the initial edges that are identical. That is, if ei = /i , e2 = fi, ■■■,ej = fj, and ej+i i= f+i delete the first j edges of both 
paths. Once this is done, both paths start at the same vertex, call it Vc , and both stiU end at Vj, . Now we construct a cycle by starting at Vc and 
following what is left of pi until we first meet what is left of p2 • If this first meeting occurs at vertex , then the remainder of the cycle is 
completed by following the portion of the reverse of p2 that starts at vj and ends at Vc . ■ 

Theorem 10.1 J. Let G = (V, E) be an undirected graph with no self-loops and | V | = n. The following are all equivalent: 

(1) Gis a tree. 

(2) For each pair of distinct vertices in V, there exists a imique simple path between them. 

(3) G is connected, and if e e £, then (V, E - {e}) is discormected. 

(4) G contains no cycles, but by adding one edge, you create a cycle. 

(5) C is connected and \ E \ = n - I. 

Proof Strategy. Most of this theorem can be proven by proving the following chain of implications; (1) => (2), (2) => (3), (3) => (4), and 
(4) => (1). Once these implications have been demonstrated, the transitive closure of on {1 , 2, 3, 4} establishes the equivalence of the first 
four conditions. The proof that Statement 5 is equivalent to the first four can be done by induction, which we will leave to the reader. 

Proof (1) (2) (Indirect). Assume that G is a tree and that there exists a pair of vertices between which there is either no path or there are at 
least two distinct paths. Both of these possibihties contradict the premise that G is a tree. If no path exists, G is discormected, and if two paths 
exist, a cycle can be obtained by Theorem 10.1.1. 

(2) => (3). We now use Statement 2 as a premise. Since each pair of vertices in V are connected by exactly one path, G is connected. Now if we 
select any edge e in E, it connects two vertices, vi and V2 . By (2), there is no simple path connecting vi to V2 other than e. Therefore, no path at 
all can exist between vi andv2in(V, E - [e]). Hence (V, E - {e}) is discormected. 

(3) => (4). Now we will assume that Statement 3 is true. We must show that G has no cycles and that adding an edge to C creates a cycle. We 
will use an indirect proof for this part. Since (4) is a conjunction, by DeMorgan's Law its negation is a disjunction and we must consider two 
cases. First, suppose that C has a cycle. Then the deletion of any edge in the cycle keeps the graph connected, which contradicts (3). The second 
case is that the addition of an edge to G does not create a cycle. Then there are two distinct paths between the vertices that the new edge 
coimects. By Theorem 10.1 .1 , a cycle can then be created, which is a contradiction. 

(4) => (1) Assume that G contains no cycles and that the addition of an edge creates a cycle. AH that we need to prove to verify that G is a tree 
is that G is connected. If it is not connected, then select any two vertices that are not connected. If we add an edge to connect them, the fact that 
a cycle is created implies that a second path between the two vertices can be found which is in the original graph, which is a contradiction. ■ 

The usual definition of a directed tree is based on whether the associated undirected graph, which is obtained by "erasing" its directional arrows, 
is a tree. In Section 10.3 we will introduce the rooted tree, which is a special type of directed tree. 

EXERCISES FOR SECTION 10.1 
A Exercises 

1 . Given the following vertex sets, draw all possible undirected trees that coimect them. 

(a) Va = {right, left} 

(b) V, = {+, -, 0} 

(c) Vc = {north, south, east, west). 

2. Are all trees planar? If they are, can you explain why? If they are not, you should be able to find a nonplanar tree. 

3. Prove that if G is a simple undirected graph with no self-loops, then G is a tree if and only if G is coimected and \E\ = 

4. (a) Prove that if G = (V, E) is a tree and e e E, then {V, E - {e}) is a forest of two trees. 

(b) Prove that ifCVi.iii) and (V2, E2) are disjoint trees and e is an edge that connects a vertex in Vi to a 
(Vi U V2, £1 U £2 U [e]) is a tree. 

5. (a) Prove that any tree with at least two vertices has at least two vertices of degree 1. 
(b) Prove that if a tree is not a chain, then it has at least three vertices of degree 1 . 



I VI- 1 . 

vertex in V2 , then 
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I 10.2. Spanning Trees 

The topic of spanning trees is motivated by a graph-optimization problem. 

Example 10.2.1. A map of Atlantis (Figure 10.2.1) shows that there are four campuses in its university system. A new secure communications 
system is being installed and the objective is to allow for communication between any two campuses, to achieve this objective, the university 
must buy direct lines between certain pairs of campuses. Let G be the graph with a vertex for each campus and an edge for each direct line. 
Total communication is equivalent to G being a connected graph. This is due to the fact that two campuses can communicate over any number 
of lines. To minimize costs, the university wants to buy a minimum number of lines. 



North 



West 



East 



South 



Figure 10.2.1 

The Atlantis University System 

The solutions to this problem are all trees. Any graph that satisfies the requirements of the university must be connected, and if a cycle does 
exist, any line in the cycle can be deleted, reducing the cost. Each of the sixteen trees that can be drawn to connect the vertices North, South, 
East, and West (see Exercise Ic of Section 10.1) solves the problem as it is stated. Note that in each case, three direct lines must be purchased. 
There are two considerations that can help reduce the number of solutions that would be considered. 

Objective 1: Given that the cost of each line depends on certain factors, such as the distance between the campuses, select a tree whose cost is 
as low as possible. 

Objective 2: Suppose that communication over multiple lines is noisier as the number of lines increases. Select a tree with the property that the 
maximum number of lines that any pair of campuses must use to communicate with is as small as possible. 

Typically, these objectives are not compatible; that is, you cannot always simultaneously achieve these objectives. In the case of the Atlantis 
university system, the solution with respect to Objective 1 is indicated with solid lines in Figure 10.2.1. There are four solutions to the problem 
with respect to Objective 2: any tree in which one campus is directly connected to the other three. One solution with respect to Objective 2 is 
indicated with dotted lines in Figure 10.2.1 . After satisfying the conditions of Objective 2, it would seem reasonable to select the cheapest of the 
four trees. 

Definition: Spanning Set, Spanning Tree. 

(a) Let G = (V, E) be a connected undirected graph. A spanning set for G is a subset E' of E such that {V, E') is connected. 

(b) IfE' is a spanning set for G and T = {V, E') is a tree, then T is called a spanning tree for G. 

Notes 

(a) If (V, £') is a spanning tree, \E'\ =\'V\ - 1. 

(b) The significance of a spanning tree is that it is a minimal spanning 

set. A smaller set would not span the graph, while a larger set would have a cycle, which has an edge that is superfluous. 

For the remainder of this section, we will discuss two of the many topics that relate to spanning trees. The first is the Minimal Spanning Tree 
Problem, which addresses Objective 1 above. The second is the Minimum Diameter Spanning Tree Problem, which addresses Objective 2. 

THE MINIMAL SPANNING TREE PROBLEM 

Given a weighted connected undirected graph G = (V , E, w) , the minimal spanning tree problem is to find a spanning tree (V, E') for which 
Y, w{e) is as small as possible. 

Unlike many of the graph-optimization problems that we've examined, a solution to this problem can be obtained efficiently. It is a situation in 
which a greedy algorithm works. 

Definition: Bridge. Let G = ( V , E) be an undirected graph and let [L, R] be a partition ofV. A bridge between L and R is an edge in E 
that connects a vertex in Lto a vertex in R. 
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Theorem 10.2.1. Let G = (V, E, w) be a weighted connected undirected graph. Let V be partitioned into two sets L and R. If e* is a 
bridge of least weight between L and R, then there exists a minimal spanning tree for G that includes e* . 

Proof (Indirect proof): Suppose that no minimum spanning tree including e* exists. Let T = (V, E') be a minimum spanning tree. If we 
add e* to T, a cycle is created, and this cycle must contain another bridge, e, between L and R. Since w(e*) < w(e), we can delete e and the 
new tree, which includes e* must also be a minimum spanning tree. ■ 

Example 10.2.2. The bridges between the vertex sets {a, b, c] and [d, e] in Figure 10.2.2 are the edges \b, d] and |c, e). According to 
the theorem, a minimal spanning tree that includes \b, d] exists. By examination, you should be able to see that this is true. Is it true that only 
the bridges of minimal weight can be part of a minimal spanning tree? The answer is no. 




Figure 10.2.2 

Theorem 10.2.1 essentially tells us that a minimal spanning tree can be constructed recursively by continually adding minimally weighted 
bridges to a set of edges. 

Algorithm 102.1: Prim's Algorithm for Finding a Minimal Spanning Tree. Let G = (V, E, w) be a connected, weighted, undirected 
graph, and let Vg be an arbitrary vertex in V. The following steps lead to a minimal spanning tree for G. L and R will be sets of vertices and 
E ' is a set of edges. 

(1) (Initialize) L = V - [vg]; R = [vg]; E' = 0. 

(2) (Build the tree) While L =f= 

(2.1) Find e* = [v^, v^], a bridge of minimum weight between L and R. 

(2.2) R = R \J [vl): L = L - |vz.) ; £' .■=E'[J [e'] 

(3) Terminate with a minimal spanning tree (V , E'). 



Notes: 

(a) If more than one minimal spanning tree exists, then the one that is obtained depends on vo and the means by which e* is selected in Step 
2.1 if two minimally weighted bridges exist. 

(b) Warning: If two minimally weighted bridges exist between L and R, do not try to speed up the algorithm by adding both of them to E . 

(c) That Algorithm 10.2.1 yields a minimal spanning tree can be proven by induction with the use of Theorem 10.2.1 . 

(d) If it is not known whether G is connected. Algorithm 10.2.1 can be revised to handle this possibility. The key change (in Step 2.1) would 
be to determine whether any bridge at all exists between L and R. The condition of the while loop in Step 2 must also be changed somewhat. 

Example 10.2.3. Consider the graph in Figure 10.2.3. If we apply Algorithm 10.2.1 starting at a, we obtain the following edge list in the 
order given: [a, f\, {/, e\, [e, c|, (c, d], {/, b\, [b, g]. The total of the weights of these edges is 20. The method that we have used (in 
Step 2.1) to select a bridge when more than one minimally weighted bridge exists is to order all bridges alphabetically by the vertex in L and 
then, if further ties exist, by the vertex in R. The first vertex in that order is selected in Step 2.1 of the algorithm. 
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Figure 10.2.3 

The Minimum Diameter Spanning Tree Probiem 

Given a connected undirected graph G = (V, E), find a spanning tree T = (V, E') of G such that the longest path in T is as short as possible. 

Example 10.2.4. The Minimum Diameter Spanning Tree Problem is easy to solve in a K„. Select any vertex vq and construct the spanning tree 
whose edge set is the set of edges that connect vq to the other vertices in the K„ . Figure 10.2.4 illustrates a solution for « = 5 . 




Minimum diameter spanning tree for 

For incomplete graphs, a two-stage algorithm is needed. In short, the first step is to locate a "center" of the graph. The maximum distance from 
a center to any other vertex is as small as possible. Once a center is located, a breadth-first search of the graph is used to construct the spanning 
tree. The breadth-first search is essentially the broadcasting algorithm that we discussed in Section 9.3 applied to undirected graphs. 

EXERCISES FOR SECTION 10.2 
A Exercises 

1. Suppose that after Atlantis University's phone system is in place, a fifth campus is established and that a transmission line can be bought to 
connect the new campus to any old campus. Is this larger system the most economical one possible with respect to Objective 1? Can you always 
satisfy Objective 2? 

2. Construct a minimal spanning tree for the capital cities in New England (see Figure 9.5.1). 

3. Show that the answer to the question posed in Example 10.2.2 is "no." 

4. Find a minimal spanning tree for the following graphs. 
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5 . Find a minimum diameter spanning tree for the following graphs. 

(a) 

11 12 13 14 15 



6 7 8 9 10 



1 2 3 4 5 



(b) 




6. In each of the following parts back up your answer with either a proof or an counterexample. 

(a) Suppose a weighted undirected graph had distinct edge weights. Is is possible that no minimal spanning tree includes the edge of minimal 
weight? 

(b) Suppose a weighted undirected graph had distinct edge weights. Is is possible that every minimal spanning tree includes the edge of 
maximal weight? 



1 10.3 Rooted Trees 

In the next two sections, we wiU discuss rooted trees. Our primary focuses wiU be on general rooted trees and on a special case, ordered binary 
trees. 

Informal Definition and Terminology: What differentiates rooted trees from undirected trees is that a rooted tree contains a distinguished 

vertex, called the root. Consider the tree in Figure 10.3.1 . Vertex A has been designated the root of the tree. If we choose any other vertex in the 
tree, such as M, we know that there is a unique path from A to M. The vertices on this path, (A, D, K, M), are described in genealogical terms: 

M is a child of K (so is L). 

iTisAfs parent. 

A, D, and K are Af s ancestors. 
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D, K, and M are descendants of A. 



M 






I 







FIGURE 10.3.1 

Rooted tree 

These genealogical relationships are often easier to visualize if the tree is rewritten so that children are positioned below their parents, as in 
Figure 10.3.2. 

With this format, it is easy to see that each vertex in the tree can be thought of as the root of a tree that contains, in addition to itself, all of its 
descendants. For example, D is the root of a tree that contains D, K, L, and M. Furthermore, K is the root of a tree that contains K, L, and M. 
Finally, L and M are roots of trees that contain only themselves. From this observation, we can give a formal definition of a rooted tree. 



Root 




M 



FIGURE 10.3.2 

Rooted tree of Figure 10.3.1 , redrawn 

Definition: Rooted Tree. 

(a) Basis: 

(i) A tree with no vertices is a rooted tree (the empty tree), 
( ii) A single vertex with no children is a rooted tree 

(b) Recursion: 

Let Ti, T2, ... , Tr, r > 1 , be disjoint rooted trees with roots vi, V2 Vr, respectively, and let Vg be a vertex that does not 

belong to any of these trees. Then a rooted tree, rooted at Vg, is obtained by making Vg the parent of the vertices v\, V2, and 
Vr- We call T\, T2, ... , T,-, subtrees of the larger tree. 

The level of a vertex of a rooted tree is the number of edges that separate the vertex from the root. The level of the root is zero. The depth of a 
tree is the maximum level of the vertices in the tree. The depth of a tree in Figure 10.3.2 is three, which is the level of the vertices L and M. The 
vertices E, F, G, H, I, J, and K have level two. B, C, and D are at level one and A has level zero. 
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Example 103.1. Figure 2.1 .1, which we reproduce below is a rooted tree with Start as the root. It is an example of what is called a decision tree. 




Figure 2.1 .1 

Example 103.2: Data Structm-es. One of the keys to working with large amounts of information is to organize it in a consistent, logical way. 
A data structure is a scheme for organizing data. A simple example of a data structure might be the information a college admissions depart- 
ment might keep on their applicants. Items in a flat file might look something like this: 

Applicantltem = (FirstName, Middlelnitial, LastName, StreetAddress, City, State, Zip, HomePhone, CellPhone, EmailAddress, 
HighSchool, Major, ApplicationPaid, MathSAT, VerbalSAT, Recommendation 1, Recomraendation2, Recommendations) 

A spreadsheet can be used to arrange data in this way. Although a flat file structure is often adequate, there are often advantages to clustering 
some the information. For example the applicant information is broken into four parts: name, contact information, high school, and application 
data. 

Applicantltem = {{FirstName, Middlelnitial, LastName), {{StreetAddress, City, State, Zip), {HomePhone, CellPhone), EmailAddress), 

HighSchool, {Major, ApplicationPaid, {MathSAT, VerbalSAT), {Recommendation 1, Recommendation2, Recommendations))) 

The first item in the Applicantltem list is a Ust {FirstName, Middlelnitial, LastName}, with each item in that list being a single field of the 
original flat file. The third item is simply the single high school item from the flat file. The application data is a list and one of its items, is 

itself a list with the recommendation data for each recommendation the applicant has. 

The organization of this data can be visualized with a rooted tree such as the one in Figure 10.3.3. 
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Figure 10.3.3 

Structured Data 

In general, you can represent a data item, T, as a rooted tree with T as the root and a subtree for each field. Those fields that are more than just 
one item are roots of further subtrees, while individual items have no further children in the tree. 

KRUSKAL'S ALGORITHM 

An alternate algorithm for constructing a minimal spanning tree uses a forest of rooted trees. First we will describe the algorithm in its simplest 
terms. Afterward, we will describe how rooted trees are used to implement the algorithm. Finally, we will describe a simple data structure and 
operations that make the algorithm quite easy to program. In all versions of this algorithm, assume that G = (V, E, w) is a weighted undi- 
rected graph with \ V\ = m and \E\ = n. 



KruskaVs Algorithm— Version 1. 

( 1 ) Sort the edges of G in ascending order according to weight. That is, 

i < i <=> w[ej) < w[ej). 

(2) Go down the list obtained in Step 1 and add edges to a set (initially empty) of edges so that the set does not form a cycle. When an edge 
that would create a cycle is encountered, ignore it. Continue examining edges until either m - 1 edges have been selected or you have come 
to the end of the edge list. If m - I edges are selected, these edges make up a minimal spanning tree for G. If fewer than m - I edges are 
selected, G is not connected. 



Note: Step I can be accomplished using one of any number of standard sorting routines. Using the most efficient sorting routine, the time 
required to perform this step is proportional to n log «. The second step of the algorithm, also of n log n time complexity, is the one that uses a 
forest of rooted trees to test for whether an edge should be added to the spanning set. 



KruskaVs Algorithm— Version 2. 

(1) Sort the edges as in Version 1 . 

(2) (2.1) Initialize each vertex in V to be the root of its own rooted tree. 

(2.2) Go down the list of edges until either a spanning tree is completed or the edge list has been exhausted. For each edge 
e = {vj, V2), we can determine whether e can be added to the spanning set without forming a cycle by determining whether the root 
ofvj's tree is equal to the root of V2' s tree. If the two roots are equal, then ignore e. If the roots are different, then we can add e to 
the spanning set. In addition, we merge the trees that Vi and V2 belong to. This is accomplished by either making V\ ' s root the 
parent 0/V2 ' s root or vice versa. 



Notes: 

(a) Since we start the algorithm with m trees and each addition of an edge decreases the number of trees by one, we end the algorithm 
with one rooted tree, provided a spanning tree exists. 

(b) The rooted tree that we develop in the algorithm is not the spanning tree itself. 
Mathematica Implementation of Kruskal's Algorithm 

We will implement Kruskal's Algorithm with Mathematica. First we will describe a very simple data structure for representing the forest of 
trees that is maintained within the algorithm. All that is needed is a list of integers, one for each of the vertices in the graph. For simplicity we 
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will take the vertices to be the integers 1 , 2, 

if jiik' s parent in the forest 
if is the root of a tree in the forest 



n and will illustrate with the case of n = 10 . If forest is the list, then 



forest[ [k] ] 



lO 



For example, if forest is equal to (0, 1, 0, 6, 3, 3, 4, 0, 3, 3), the vertices 1, 3, and 8 are roots of trees. Figure 10.3.4 shows the forest, 
where directed edges are used to indicate parent hood. An edge from k to j indicates that j is ^'s parent. Roots are sinks in this representation 
of trees. 



V 




By representing forests in this way, it is easy to program functions that give the root of a tree and merge two trees. Notice the recursive nature 
of the root function. The input cell below has been set to be not evaluatable. These functions will be defined locally within the Kruskal 
algorithm code. 

root[k_] := If [forest[ [k] ] == 0, k, root [forest [ [k] ]]] ; 
merge[vl_, v2_] ;= (f orest [ [root [v2 ] ] ] =root[vl]); 

Example 10.3.3. One way to represent a weighted graph in Mathematica is as a list of pairs {edge, weight) where the edge is a rule. For 
example, {1 •— 3, 6) would be a edge connecting vertices 1 and 3 with weight 6. For consistency, we seed the random number generator and 
then generate a random graph, wg, having 10 vertices by "flipping a coin" to determine whether each edge is present. We also assign a assign 
random weight between 4 and 10 to each edge. These numbers are arbitrary and can be adjusted, if desired. 

Here is the Mathematica code that generates the graph 
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SeedRandom[2012] ; 
n = 10; 
edges = 

Map[UndirectedEdge@@0 &, Subsets [Range [10] , {2}] // Select[n, RandomReal [ ] £ 1 / 2 &] &] ; 
e = Length [edges] ; 

weights = Randomlnteger [ {4 , 10}, e] ; 

graph = Graph [MapThread [Labeled [nl , R2] &, {edges, weights}]] 



edgelist = SortBy [Transpose [ {edges , weights}]. Last] 



( ^ " 


-2 


4 


2^ 


- 8 


4 


3^ 


- 8 


4 


4- 


-7 


5 


6- 


- 8 


5 


1 ► 


- 3 


6 


1 ► 


-7 


6 


4- 


-5 


6 


5 - 


- 10 


6 


2- 


- 10 


7 


4- 


-6 


7 


2^ 


-4 


8 


1 ► 


- 8 


9 


1 ► 


-9 


9 


5 ► 


-6 


9 


1 - 


- 10 


10 


2- 


-9 


10 


4- 


-9 


10 


5 ► 


-9 


10 


. 6- 


-9 


10 , 



The following function presumes that the data from a weighted undirected graph is in the {edge, weight) format described above and called 
edgelist. The output is a sublist that comprises a minimal spanning tree for the input. 
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Kruskal [g_] : = 

Module[{nV, vertexset, spanset, edgepool, forest, nextedge, root, merge, index}, 
root[k_] := If [forest [ [k] ] == 0, k, root [ forest [ [k] ]]] ; 
merge [vl_, v2_] ;= (forest [ [root [v2] ] ] =root[vl]); 

vertexset = Union [Flatten [ (List @@tt) &/@g[[All, 1]]]]; nV = Length [vertexset] ; 
Map [ (index [vertexset [ [tt] ] ] = tt) &, Range [nV]]; 

forest = Table [0, {nV}] ; spanset = {}; edgepool = SortBy [g. Last] ; 
While [ (Length [spanset] < nV - 1) && (edgepool # {}) , 
nextedge = First [edgepool] ; edgepool = Rest [edgepool] ; 

I f [ root [ index [nextedge [ [ 1 , 1 ] ] ] ] ^ root [ index [nextedge [ [ 1 , 2 ] ] ] ] , AppendTo [ 

spanset , nextedge ] ; merge [ index [nextedge [ [ 1 , 1 ] ] ] , index [nextedge [[1, 2]]]]]]; 
If [Length [spanset] ==nV-l, spanset, "Graph not connected - no spanning tree exists."]] 

We use the function to generate a spanning tree for our example: 

St = Kruskal [edgelist] 
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Lets take a look at the spanning tree, with tree edges colored blue and the unused edges colored red. 




Note: The Mathematica code to ttiat displays this and other spanning trees is not so instructive to the general reader. If you are interested in seeing the code, it is 
available as part of the package ADS2.m. See the text's web page for download information. The function name is KruskalGraph. 

Example 10.3.4. Lets look at how long it take to complete the algorithm for a graph with 200 edges. Drawing the graph isn't of interest, 
it's mostly the time to generate the spanning tree we are interested in. 

SeedRandom[2010] ; 

largegraph = Map [ {UndirectedEdge @@ n, Randomlnteger [ { 4, 10}]} &, 

RandomGraph [BernoulliGraphDistribution [200, 0.5]] // EdgeList // 
Select [tt, (First [tt] < Last [tt] ) &] &] ; 

The time will vary depending on the computer you are using. The expression First [Tiining[caIculation;]] will perform a calculation and 
just display the CPU time needed, in seconds, to complete the calculation. 

{First [Timing [largetree = Kruskal [largegraph] ; ] ] , $System} 

(0.564645, Mac OS X x86 (64-bit)) 
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Just for fun, here is what the spanning tree looks like. 
KruskalGraph [largetree. Blue] 




Example 10.3.5. Lets conclude with an example that is based on some real data, using Wolfram's Computable database of city data. 
We will build a minimal spanning tree for the large cities of France, where "large" is taken to mean a population of 100,000 or more. The 
weights of edges between cities will the distance between them. First here is that list of cities. By the way, you can change "France" to any 
other country or region and get similar results. 
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citylist = CltyData[ {Large, "France"}] 



Paris 


IleDeFrance 


France ^ 


Marseille 


ProvenceAlpesCoteDAzur 


France 


Lyon 


RhoneAlpes 


France 


Toulouse 


MidiPyrenees 


France 


Nice 


ProvenceAlpesCoteDAzur 


France 


Nantes 


PaysDeLaLoire 


France 


Strasbourg 


Alsace 


France 


Montpellier 


LanguedocRoussillon 


France 


Bordeaux 


Aquitaine 


France 


LiUe 


NordPasDeCalais 


France 


Rennes 


Bretagne 


France 


LeHavre 


HauteNormandie 


France 


Reims 


ChampagneArdenne 


France 


SaintEtienne 


RhoneAlpes 


France 


Toulon 


ProvenceAlpesCoteDAzur 


France 


Grenoble 


RhoneAlpes 


France 


Angers 


PaysDeLaLoire 


France 


Dijon 


Bourgogne 


France 


Brest 


Bretagne 


France 


LeMans 


PaysDeLaLoire 


France 


Nimes 


LanguedocRoussillon 


France 


AixEnProvence 


ProvenceAlpesCoteDAzur 


France 


Limoges 


Limousin 


France 


ClermontFerrand 


Auvergne 


France 


Tours 


Centre 


France 


Amiens 


Picardie 


France 


Villeurbanne 


RhoneAlpes 


France 


Metz 


Lorraine 


France 


Besancon 


FrancheComte 


France 


Perpignan 


LanguedocRoussillon 


France 


Orleans 


Centre 


France 


Rouen 


HauteNormandie 


France 


Mulhouse 


Alsace 


France 


Caen 


BasseNormandie 


France 


BoulogneBillancourt 


IleDeFrance 


France 


Nancy 


Lorraine 


France 


Montreuil 


IleDeFrance 


France 


Argenteuil 


IleDeFrance 


France j 



As part of the geographic database, there is a functional called GeoDistance that we will use to define a function that tells us how many 
kilometers separate any two cities. 

City-Distance [city 1_, city2_] : = 
GeoDistance [CityData[cityl, "Coordinates"], CityData[city2, "Coordinates"]] / 1000 

Next, we generate the weighted graph. 

g = Map[{First[n[ [1] ] ] ^ First [tt[ [2] ]] , CityDistance @@ tt} &, Tuples [citylist, 2]] // 
Select [R, Last[n] > &] &; 

In France there are large cities and drawing the Kjg produces a largely uninteresting figure. By drawing only the edges for cities within 300 
km of one another, the figure is a Uttle more interesting. The cluster in the north central part of France is greater-Paris, where several other 
adjacent cities are also large. We use the longitude and latitude of each city to plot a location on the plane. 
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Graph[g // Select[tt, Last[S] < 300 &] & // B[ [All, 1] ] &, 
VertexCoordinates ^ Map [ (First [B] -» Reverse [CityData [S, "Coordinates"]]) &, citylist] , 
EdgeStyle -> {Darker, Magenta}, VertexStyle -» Blue] 




Now we use our Kruskal function to generate a spanning tree. Notice the first few edges in the list are of very close cites. 
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span = Kruskal [g] 



Villeurbanne -> Lyon 


4.04522 ^ 


Montreuil -> Paris 


6.6039 


BoulogneBillancourt -» Paris 


8.06249 


Argenteuil -> Paris 


11.5285 


Marseille AixEnProvence 


25.091 


MontpeUier -» Nimes 


46.3561 


Nancy Metz 


47.825 


Marseille Toulon 


48.9574 


LeHavre Caen 


49.043 


Lyon SaintEtienne 


50.2382 


Rouen -» LeHavre 


69.9004 


Besancon -> Dijon 


75.559 


Tours LeMans 


78.1345 


LeMans -» Angers 


80.1425 


Angers -> Nantes 


82.6313 


Villeurbanne Grenoble 


92.0243 


Nimes -» AixEnProvence 


94.3932 


Mulhouse -> Strasbourg 


96.3815 


Rennes -> Nantes 


98.1899 


Amiens Lille 


98.9384 


Argenteuil -» Rouen 


101.2 


Rouen -» Amiens 


101.847 


Orleans BoulogneBillancourt 


106.438 


Tours -> Orleans 


107.746 


SaintEtienne -> ClermontFerrand 


109.348 


Besancon Mulhouse 


115.034 


Strasbourg -> Nancy 


117.825 


Reims Montreuil 


124.727 


Toulon -> Nice 


126.35 


MontpeUier -> Perpignan 


128.738 


ClermontFerrand -> Limoges 


142.36 


Toulouse -» Perpignan 


155.437 


Metz Reims 


157.396 


Villeurbanne Dijon 


173.794 


SaintEtienne -> Nimes 


176.717 


Bordeaux -> Limoges 


180.776 


Brest -> Rennes 


211.714 ; 



Here are a couple of graphs of the spanning tree. The first shows city names, but would need to be blown up to see them all. 
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GraphPlot [span, EdgeLabeling -> False, VertexLabeling -» True, 
VertexCoordinateRules -» Map [ (First [n] -» Reverse [CityData [tt, "Coordinates"]]) &, citylist] ] 







Lille 








Amiens 








^ ' RoLieii r, 

Caen Reims 

11 m OPntPIIM 








Metz 


Brest 


DUl '"C 

Strasbourg 




" Rennes 


LeMans Orleans 


Mulhouse 




Nantes 


ngers Tours Dijon 


sancoii 









Limoges srmontFeirand le 

^ ^ Grenoble 

Bordeaux j 

Toulouse Mor ,d Nice 

Toulon 

Perpignan 

The same graph with VertexLabeling shut off shows the tree's structure. 




Kruskal's algorithm has been implemented in Sage. Here we illustrate how the spanning tree for the graph in Example 10.3.3 can be generated. 
We start with graph data consisting of a list of triples of the form (vertex, vertex, label). The weighted method tells Sage to consider the 
labels as weights. 



edges=[(l, 2, 4), (2, 8, 4), (3, 8, 4), (4, 7, 5), (6, 8, 5), (1, 3, 6), (1, 7, 6), (4, 5, 6), 
(5, 10, 6), (2, 10, 7), (4, 6, 7), (2, 4, 8), (1, 8, 9), (1, 9, 9), (5, 6, 9), (1, 10, 10), (2, 
9, 10), (4, 9, 10), (5, 9, 10), (6, 9, 10)] 
G=Graph ( edges ) 
G. weighted ( True ) 

G. graphplot ( edge_labels=True, save_pos=True ) .show( ) 
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Next, we load the kiuskal function and use it to generate the hst of edges in a spanning tree of G. 



from sage . graphs . spanning_tree import kruskal 
E = kruskal(G, check=True) ;E 

[(1, 2, 4), (2, 8, 4), (3, 8, 4), (4, 7, 5), (6, 8, 5), (1, 7, 6), (4, 5, 6), (5, 10, 6), (1, 9, 9)] 

To see the resulting tree with the same embedding as G, we generate a graph from the spanning tree edges. Next, we set the positions of the 
vertices to be the same as in the graph. Finally, we plot the tree. 



T=Graph(E) 

T . set_pos ( G . get_pos ( ) ) 

T. graphplot ( edge_labels=True) .show( ) 
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EXERCISES FOR SECTION 10.3 
A Exercises 

1 . Suppose that an undirected tree has diameter d and that you would like to select a vertex of the tree as a root so that the resulting rooted tree 
has the smallest depth possible. How would such a root be selected and what would be the depth of the tree (in terms of d)l 

2. Use Kruskal's algorithm to find a minimal spanning tree for the following graphs. In addition to the spanning tree, find the final rooted tree 
in the algorithm. When you merge two trees in the algorithm, make the root with the lower number the root of the new tree. 

(a) 

E 

E 

H 




(b) 




Exercises 

3. Suppose that information on buildings is arranged in records with five fields: the name of the building, its location, its owner, its height, and 
its floor space. The location and owner fields are records that include all of the information that you would expect, such as street, city, and state, 
together with the owner's name (first, middle, last) in the owner field. Draw a rooted tree to describe this type of record, 

4. (Requires Mathematica) Given a country with n large cities evenly distributed in an area of A square kilometers, one would expect that the 
total length of the spanning tree for the cities would roughly depend on n and A. In the case of France, n = 31 cities, A = 547030 km- , and the 
length of the spanning tree is 3501.59 km. Look for such a relationship by collecting similar data for other countries or generating random data. 
To get areas of countries you can use the CountryData function. 

CountryData [ "France " , " Area " ] 

547030. 



Caution: 
model. 



Oddly shaped countries such as Chile, or countries that have large uninhabited areas such as Brazil will probably not fit any proposed 



I 10.4 Binary Trees 



An ordered rooted tree is a rooted tree whose subtrees are put into a definite order and are, themselves, ordered rooted trees. An empty tree and 
a single vertex with no descendants (no subtrees) are ordered rooted trees. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-StiareAlilce Works 3.0 United States License. 



Chapter 10 - Trees 

Example 10.4.1. The trees in Figure 10.4.1 are identical rooted trees, with root 1 , but as ordered trees, they are different. 




Figure 10.4.1 

Two different ordered rooted trees 

If a tree rooted at v has p subtrees, we would refer to them as the first, second, subtrees. If we restrict the number of subtrees of each 

vertex to be less than or equal to two, we have a binary (ordered) tree. 

Definition: Binary Tree. A binary tree is 

(a) a tree consisting of no vertices (the empty tree), or 

(b) a vertex with two subtrees that are both binary trees. The subtrees are called the left and right subtrees. 

The difference between binary trees and ordered trees is that every vertex of a binary tree has exactly two subtrees (one or both of which may 
be empty), while a vertex of an ordered tree may have any number of subtrees. The two trees in Figure 10.4.2 would be considered identical as 
ordered trees; however, they are different binary trees. Tree (a) has an empty right subtree and Tree (b) has an empty left subtree. 

(a) (b) 





Terminology and General Facts: 

(a) A vertex of a binary tree with two empty subtrees is called a leaf. All other vertices are called internal vertices. 

(b) The number of leaves in a binary tree can vary from one up to roughly half the number of vertices in the tree (see Exercise 4 of this 
section). 

(c) The maximum number of vertices at level /; of a binary tree is 2* ,k >Q (see Exercise 6 of this section) . 

(d) A full binary tree is a tree in which each vertex has either zero or two empty subtrees. In other words, each vertex has either two or zero 
children. See Exercise 7 of this section for a general fact about full binary trees. 

TRAVERSALS OF BINARY TREES 

The traversal of a binary tree consists of visiting each vertex of the tree in some prescribed order. Unlike graph traversals, the consecutive 
vertices that are visited are not always connected with an edge. The most common binary tree traversals are differentiated by the order in which 
the root and its subtrees are visited. The three traversals are best described recursively and are: 

(1) Preorder Traversal: 

(a) Visit the root of the tree. 

(b) Preorder traverse the left subtree. 

(c) Preorder traverse the right subtree. 

(2) Inorder Traversal: 

(a) Inorder traverse the left subtree. 

(b) Visit the root of the tree. 

(c) Inorder traverse the right subtree. 

(3) Postorder Traversal: 
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(a) Postorder traverse the left subtree. 

(b) Postorder traverse the right subtree. 

(c) Visit the root of the tree. 

Any traversal of an empty tree consists of doing nothing. 

Example 10.4.2. For the tree in Figure 10.4.3, the orders in which the vertices are visited are: 
A-B-D-E-C-F-G, for the preorder traversal 
D-B-E-A-F-C-G, for the inorder traversal and 
D-E-B-F-G-C-A, for the preorder traversal. 




Figure 10.4.3 



Example 10.4.3: Binary Tree Sort. Given a collection of integers (or other objects than can be ordered), one technique for sorting is a 
binary tree sort. If the integers are aj , 02, . . . , fln, « > 1 , we first execute the following algorithm that creates a binary tree: 

(1) Insert a I into the root of the tree . 

(2) For k := 2 to n //insert Oi, into the tree 

(2.1) r = ai , 

(2.2) inserted = false 

(2.3) While Not(inserted) Do 
If a^. < r then 

if r has a left child 

then r = left child ofr 

else make a/, the left child ofr: inserted = true 
else // af. > r 

if r has a right child 

thenr = right child of r 

else make a/, the right child ofr: inserted = true 

If the integers to be sorted are 25, 17, 9, 20, 33, 13, and 30, then the tree that is created is the one in Figure 10.4.4. The inorder traversal of this 
tree is 9, 13, 17, 20, 25, 30, 33, the integers in ascending order. In general, the inorder traversal of the tree that is constructed in the algorithm 
above will produce a sorted list. The preorder and postorder traversals of the tree have no meaning here. 
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13 



Figure 10.4.4 

EXPRESSION TREES 

A convenient way to represent an algebraic expression is by its expression tree. Consider the expression 
X = a*b - c/d + e. 

Since it is customary to put a precedence on multiplication/divisions, X is evaluated as ((a*h) - (cjd)) + e. Consecutive multiplication/divi- 
sions or addition/subtractions are evaluated from left to right. We can analyze X further by noting that it is the sum of two simpler expressions 
(a*6) - (cjd) and e?. The first of these expressions can be broken down further into the difference of the expressions a*b and cjd. When we 
decompose any expression into 

(left expression) (operation) (right expression), 

the expression tree of that expression is the binary tree whose root contains the operation and whose left and right subtrees are the trees of the 
left and right expressions, respectively. Additionally, a simple variable or a number has an expression tree that is a single vertex containing the 
variable or number. The evolution of the expression tree for expression X appears in Figure 10.4.5. 

s 
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Example 10.4.4. 

(a) If we intend to apply the addition and subtraction operations in X first, we would parenthesize the expression to a*(b - c)/{d + e). Its 
expression tree appears in Figure 10.4.6a. 

(b) The expression trees for - and for (a + b)*(a - b) appear in Figures 10.4.6(b) and 10.4.6(c). 



(a) 




a / 




be d e 



(b) (c) 




/ \ / \ ^/ V. / \ 

aa bbab ab 



Figure 10.4.6 

The three traversals of an operation tree are all significant. A binary operation applied to a pair of numbers can be written in three ways. One is 
the familiar infix form, such s& a + b for the sum of a and b. Another form is prefix, in which the same sum is written +a b. The final form is 
postfix, in which the sum is written ab +. Algebraic expressions involving the four standard arithmetic operations (+, -, *, and /) in prefix 
and postfix form are defined as follows: 

Prefix: (a) A variable or number is a prefix expression, (b) Any operation followed by a pair of prefix expressions is a prefix expression. 
Postfix: (a) A variable or number is a postfix expression, (b) Any pair of postfix expressions followed by an operation is a postfix expression. 
The connection between traversals of an expression tree and these forms is simple: 

(a) The preorder traversal of an expression tree will result in the prefix form of the expression. 

(b) The postorder traversal of an expression tree will result in the postfix form of the expression. 

(c) The inorder traversal of an operation tree will not, in general, yield the proper infix form of the expression. If an expression requires 
parentheses in infix form, an inorder traversal of its expression tree has the effect of removing the parentheses. 

Example 10.4.5. The preorder traversal of the tree in Figure 10.4.5 is + - *ab/ c de, which is the prefix version of expression X. The postfix 
traversal is ab*c d I -e +. Note that since the original form of X needed no parentheses, the inorder traversal, a *b - cjd + e, is the correct 
infix version. 

COUNTING BINARY TREES 

We close this section with a formula for the number of different binary trees with n vertices. The formula is derived using generating functions. 
Although the complete details are beyond the scope of this text, we will supply an overview of the derivation in order to illustrate how generat- 
ing functions are used in advanced combinatorics. 

Let B (n) be the number of different binary trees of size n (n vertices), « > 0. By our definition of a binary tree, 5 (0) = 1. Now consider any 
positive integer n + 1, « > 0. A binary tree of size n + I has two subtrees, the sizes of which add up to «. The possibilities can be broken 
down into n + I cases: 
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Case 0: Left subtree has size 0; right subtree has size n. 
Case 1: Left subtree has size 1; right subtree has size n - 1. 

Case k: Left subtree has size k; right subtree has size n - k. 

Case n: Left subtree has size n; right subtree has size 0. 

In the general Case k, we can count the number of possibilities by multiplying the number of ways that the left subtree can be filled, B (k), by 
the number of ways that the right subtree can be filled. B(n - k). Since the sum of these products equals B{n + 1), we obtain a recurrence 
relation for « > 0: 

Bin + \) = B{0) Bin) + B{\) Bin - \) + ■■■ + B(n)B(0) 
= ZBik)Bin-k) 

Now take the generating function of both sides of this recurrence relation: 
+ 1)2"= i; \j:Bik)Bin-k) 

or 

G(BT; z) = G(B*B; z) = G(B; z)^. 
Recall that G(B t; z) = "^'^"'f^'"' = ^^^^ If we abbreviate G (B; z) to G, we get 

— = G- zG^ - G + I = 
Using the quadratic equation we get two solutions: 

„ l+V 1-4; , „ 1-V 1-4; 

G| = and G? = 

2z 2c 

The gap in our deviation occurs here since we don't presume calculus. If we expand Gi as an extended series, we find 



G, = ^^^j^ = ^ - I - z-2z^ - 5z' - I4z' -42z' + - 

The coefficients after the first one are all negative and there is singularity at because of the - term. However if we do the same with G? we 
get 



G2= ' = l+z + 2z^ + 5z^ + Uz*+42z^ + ■■■ 

Further analysis leads to a closed form expression for B (n), which is 
1 I2n 

Bin) = 

"+i V n 

This sequence of numbers is often called the Catalan numbers. For more information on the Catalan numbers, see the entry A000108 in The 
On-Line Encyclopedia of Integer Sequences. 

Mathematica Note 

It may be of interest to note how the extended power series expansions of Gi and G2 are determined using Mathematica. The function Series 
will generate a finite number of terms. For example for Gi we evaluate the expression 

,. 1 + Vl - 4 z , 
Series 1^ , {z, 0, 5} J 

I 

- -1 -z-2z^ -5z^ - Uz" -42z^ + 0{z'') 

z 

The first argument is the closed form expression for Gi . The second argument has three parts. The first part is the variable. The second part is 
the center of the expansion, 0, since we want a sum in powers of z to get the coefficients of those powers. The third part is the maximum power 
of the variable. Notice that the output also include the term Oiz^) to indicate that the remaining terms are all multiples of z^. If we wanted the 
first three terms of the expansion around - 1 , which is a sum of powers of z+ 1 , we would get 
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. 1 + Vl - 4 z , 
Seriesj^ , {z, -1, 3}J 

1 ^ff] f 1 3 ^ f 1 13 ^ ( I 61 

2 2 



+ 1--- + + \iz + yf + 



2 2V5~J I 2 ioV5~J V 2 5oV5~J 



+ if + 0((z + if) 



pTi^ Sage Note 

In Sage, one has the capability of being very specific about how algebraic expressions should be interpreted. This also makes working with 
various algebraic expressions a bit more confusing to the beginner. Here, without getting into a lot of detail, is how to get a Laurent expansion 
for Gi above. 

sage: R.<z>=PowerSeriesRing(ZZ, ' z ' ) 
sage: G=( 1+sqrt ( l-4*z ) ) / (2*z ) 
sage: G 

z"-l - 1 - z - 2*z"2 - 5*z'3 - 14*z"4 - 42*z'~5 - 132*z'~6 - 429*z'~7 - 1430*z'~8 - 4862*z"9 - 
16796*z"10 - 58786*z"ll - 208012*z'12 - 742900*z"13 - 2674440*z'14 - 9694845*z"15 - 
35357670*z'16 - 129644790*z'17 - 477638700*z " 18 + 0(z"19) 

The first output above declares a structure called a ring that contains power series. Here we are not using that structure, just a specific element, 
G. So the important thing about this first input is that it establishes z as being a variable associated with power series. When the second 
expression defines the value of G in terms of z, it is automatically converted to a power series. Notice that the final output does match the 
Mathematica result, but with more terms displayed. In Chapter 16 we will introduce rings and will be able to take advantage of Sage's 
capabilities in this area and this will probably make more sense. 

EXERCISES FOR SECTION 10.4 
A Exercises 

1 . Draw the expression trees for the following expressions: 

(a) a{b + c) 

(b) ab + c 

(c) ab + ac 

(d) bb - Aac 

(e) ((03 X + a2) X +ai)x + ao 

2. Draw the expression trees for 

(a) — 

(b) xy + xz + yz 

3. Write out the preorder, inorder, and postorder traversals of the trees in Exercise 1 above. 

4. Verify the formula for B(^:), <k < 3 by drawing all binary trees with three or fewer vertices. 

5 . (a) Draw a binary tree with seven vertices and only one leaf. 

(b) Draw a binary tree with seven vertices and as many leaves as possible. 

B Exercises 

6. Prove that the maximum number of vertices at level k oi & binary tree is 2*' and that a tree with that many vertices at level k must have at 
least 2*+' - 1 vertices. 

7. Prove that if T is a full binary tree, then the number of leaves of T is one more than the number of internal vertices (non-leaves). 

8. Use Mathematica to determine the sequence whose generating function is Giz) = jY~~j3" 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 10 

Section 10.1 

1 . Show that in a tree with n vertices, the sum of the degrees of all vertices is 2 n - 2. 

2. (a) Show that there exists a tree with ten vertices and the property that each vertex has degree either 1 or 5. 
(b) Prove that no such tree exists with an odd number of vertices. 

3. Given G = (V, E) with | V| = v and \E\ = e,G is graceful if the elements in V can be labeled with v distinct positive integers such that for each 
positive integer k, I <k < e, there is an edge connecting vertices ; and j such that 1; - j\ = k. 

(a) Which of the following graphs are graceful? 



V2 









G, 








"2 




"3 







G, 



G4 



(b) Prove that every chain (see Figure 9.6.1) is graceful. It has been conjectured that every tree is graceful. 

Section 10.2 

4. Let G be the graph 




Find a minimal spanning tree for G using the method of left and right sets. Start with R = [a] and L = {b, c, d, e\. At each step show what L 
and R are and indicate what edge you added to the tree. 

Section 10.3 

5. Find a minimal spanning tree for the graph below. Use Kruskal's algorithm and draw the forest of rooted trees after you have added the third 
edge to the spanning set. 
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6. If a rooted tree has the properties that each vertex has no more than m children and its depth is less than or equal to n, how many vertices 
could it have? 

Section 10.4 

7. Represent the algebraic expression (((a - h)*c) + 7)*i(d + 4)/x) by a tree. Determine the depth of the tree. 

8. (a) Write out the post order traversal for the graph below. 

(b) Write out the inorder traversal for the graph below. 

(c) Write out the preorder traversal for the graph below. 




D E F 




9. (a) Draw the operation tree for the expression + 2ab + b, where all multiplications are done first, = xx, and additions are done from left 
to right (as usual). 

(b) List the postorder traversal of the tree that you obtained in part a. What is the significance of this traversal? 

10. (a) Draw the binary tree that would be constructed by the binary sort algorithm for sorting the integers 5, 55, 34, 38, 11, 3, 71, 23, and 18 
in descending order. Afterwards, list the preorder traversal of the tree and then build a sorting tree from that order. Repeat the process above, 
but do an inorder traversal instead of preorder. 

(b) Based on the results of part a, what can you say about the process of building a binary sorting tree, storing it in preorder (or inorder), and 
then building a tree from that new list? 
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chapter 1 1 




ALGEBRAIC SYSTEMS 



GOALS 

The primary goal of this chapter is to make the reader aware of what an algebraic system is and how algebraic systems can be studied at 
different levels of abstraction. After describing the concrete, axiomatic, and universal levels, we will introduce one of the most important 
algebraic systems at the axiomatic level, the group. In this chapter, group theory will be a vehicle for introducing the universal concepts of 
isomorphism, direct product, subsystem, and generating set. These concepts can be applied to all algebraic systems. The simplicity of group 
theory will help the reader obtain a good intuitive understanding of these concepts. In Chapter 15, we will introduce some additional concepts 
and applications of group theory. We will close the chapter with a discussion of how some computer hardware and software systems use the 
concept of an algebraic system. 
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1 11.1 Operations 

One of the first mathematical skills that we all learn is how to add a pair of positive integers. A young child soon recognizes that something is 
wrong if a sum has two values, particularly if his or her sum is different from the teacher's. In addition, it is unlikely that a child would consider 
assigning a non-positive value to the sum of two positive integers. In other words, at an early age we probably know that the sum of two 
positive integers is unique and belongs to the set of positive integers. This is what characterizes all binary operations on a set. 

Definition: Binary Operation. Let S he a nonempty set. A binary operation on S is a rule that assigns to each ordered pair of elements of 
S a unique element of S. In other words, a binary operation is a function from S xS into S. 

Example 11.1.1. Union and intersection are both binary operations on the power set of any universe. Addition and multiplication are 
binary operators on the natural numbers. Addition and multiplication are binary operations on the set of 2 by 2 real matrices, 7^/2x2(1^). 
Division is a binary operation on some sets of numbers, such as the positive reals. But on the integers (1 / 2 f Z) and even on the real numbers 
(1 /O is not defined), division is not a binary operation. 

Notes: 

(a) We stress that the image of each ordered pair must be in S. This requirement disqualifies subtraction on the natural numbers from 
consideration as a binary operation, since 1 - 2 is not a natural number. Subtraction is a binary operation on the integers. 

(b) On Notation. Despite the fact that a binary operation is a function, symbols, not letters, are used to name them. The most commonly used 
symbol for a binary operation is an asterisk, *. We will also use a diamond, o, when a second symbol is needed. 

(c) If * is a binary operation on S and a, b & S, there are three common ways of denoting the image of the pair (a, b). They are: 

*ab a*b ab * 

Prefix Form Infix Form Postfix FOrm 

We are all familiar with infix form. For example, 2 + 3 is how everyone is taught to write the sum of 2 and 3. But notice how 2 + 3 was just 
described in the previous sentence! The word sum preceded 2 and 3. Orally, prefix form is quite natural to us. The prefix and postfix forms 
are superior to infix form in some respects. In Chapter 10, we saw that algebraic expressions with more than one operation didn't need 
parentheses if they were in prefix or postfix form. However, due to our fanuliarity with infix form, we will use it throughout most of the 
remainder of this book. 

Some operations, such as negation of numbers and complementation of sets, are not binary, but tmary operators. 

Definition: Unary Operation. Let S be a nonempty set. A unary operator on S is a rule that assigns to each element of S a unique 

element ofS.In other words, a unary operator is a function from S into S. 

COMMON PROPERTIES OF OPERATIONS 

Whenever an operation on a set is encountered, there are several properties that should immediately come to mind. To effectively make use of 
an operation, you should know which of these properties it has. By now, you should be familiar with most of these properties. We will list the 
most common ones here to refresh your memory and define them for the first time in a general setting. Let S be any set and * a binary operation 
onS. 

Properties that apply to a single binary operation: 

Let * be a binary operation on a set S 

* is commutative if a * b = b * a for all a, b e S. 

* is associative if (a * b) * c = a * (b * c) for all a, b, c e S. 

* has an identity if there exists an element, e, in S such that a * e = e * a = a for all a eS. 

* has the inverse property if for each a e S, there exists b e S such that a*b = b*a = e. 
We call b an inverse of a. 

* is idempotent if a * a = a for all a eS. Properties that apply to two binary operations: 
Let obe a second binary operation on S. 

o is left distributive over * if a o (b * c) = (a o b) * (a o c)for all a,b,c e S. 
o is right distributive over * if(b * c)oa = (b oa) *{c o a) for all a,b,c e S. 
o is distributive over *ifo is both left and right distributive over *. 
Ijet - bea unary operation. 

A unary operation — on S has the involution property if -(-a) = a for all a e S. 
Finally, a property of sets, as they relate to operations. 

If T is a subset ofS, we say that T is closed under * if a, b e T implies that a * b e T. In other words, by operating on elements of 
Twith *, you can't obtain new elements that are outside ofT. 
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Example 11.12. 

(a) The odd integers are closed under multiplication, but not under addition. 

(b) Let p be a proposition over U and let A be the set of propositions over U that imply p. That is; q e Aif^=>p. Then A is closed under 
both conjunction and disjunction. 

(c) The set positive integers that are multiples of 5 is closed under both addition and multiplication. 
Note: It is important to realize that the properties listed above depend on both the set and the operation(s). 

OPERATION TABLES 

If the set on which an operation is defined is small, a table is often a good way of describing the operation. For example, we might want to 
define e on {0, 1, 2) by 

f a + b ifa + b<3 
(a + b-S ifa + b>3 

The table for © is 



ffi 





1 


2 








1 


2 


1 


1 


2 





2 


2 





1 



The top row and left column are the column and row headings, respectively. To determine a®b, find the entry in Row a and Column b. The 
following operation table serves to define * on j, k). 





i 


j 


k 


i 


i 


i 


i 


j 


j 


j 


j 


k 


k 


k 


k 



Note that; j*k = j, yet k * j = k. Thus, * is not commutative. Commutivity is easy to identify in a table: the table must be symmetric with 
respect to the diagonal going from the top left to lower right. 

EXERCISES FOR SECTION 11.1 
A Exercises 

1 . Determine the properties that the following operations have on the positive integers. 

(a) addition 

(b) multipUcation 

(c) M defined by aM b = larger of a and b 

(d) mdefinedby ami = smaller of a and i 

(e) @ defined by a @ = a* 

2. Which pairs of operations in Exercise 1 are distributive over one another? 

3. Let * be an operation on a set S and A, B c 5. Prove that if A and B are both closed under *, then Af\Bis also closed under *, but A [J B 
need not be. 

4. How can you pick out the identity of an operation from its table? 

5. Define a * b by | a - & |, the absolute value of a - b. Which properties does * have on the set of natural numbers, 1^? 
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1 11 .2 Algebraic Systems 

An algebraic system is a mathematical system consisting of a set called the domain and one or more operations on the domain. If V is the 
domain and *i, *2, .... *„ are the operations, [V;*i, *2, .... *„] denotes the mathematical system. If the context is clear, this notation is 
abbreviated to V. 

Example 11^.1. 

(a) Let B* be the set of all finite strings of O's and I's including the nuU (or empty) string, A. An algebraic system is obtained by adding the 
operation of concatenation. The concatenation of two strings is simply the linking of the two strings together in the order indicated. The 
concatenation of strings a with b is denoted aob. For example, "OlIOl" <> "101" = "01101101" and A<> "100" = "100". Note that 
concatenation is an associative operation and that X is the identity for concatenation. 

Note on Notation: There isn't a standard symbol for concatenation. We have chosen <> to be consistant with the notation used in 
Mathematica for the StringJoin function, which does concatenation. Many programming languages use the plus sign for concatenation, 
but others use & or II. 

(b) Let M be any nonempty set and let * be any operation on M that is associative and has in identity in M. Our second example might seem 
strange, but we include it to illustrate a point. The algebraic system [B*; <>] is a special case of [M; *]. Most of us are much more comfort- 
able with B' than with M. No doubt, the reason is that the elements in B' are more concrete. We know what they look like and exactly how 
they are combined. The description of M is so vague that we don't even know what the elements are, much less how they are combined. Why 
would anyone want to study Ml The reason is related to this question: What theorems are of interest in an algebraic system? Answering this 
question is one of our main objectives in this chapter. Certain properties of algebraic systems are called algebraic properties, and any 
theorem that says something about the algebraic properties of a system would be of interest. The ability to identify what is algebraic and what 
isn't is one of the skills that you should learn from this chapter. 

Now, back to the question of why we study M. Our answer is to illustrate the usefulness of M with a theorem about M. 

Theorem 11 2.1. If a, b are elements of M and a *b = b *a,then(a * b) * (a * b) = (a * a) * (b *b). 
Proof: 

(a*b)*(a*b) = a*ib*{a* b)) Why ? 

= a*((b*a)*b) Why? 
= a*{{a*b)*b) Why? 
= a*(a*(b*b)) Why? 
= (a*a)*(h*h) Why? 

The power of this theorem is that it can be appUed to any algebraic system that M describes. Since 6* is one such system, we can apply 
Theorem 11.2.1 to any two strings that commute— for example, 01 and 0101. Although a special case of this theorem could have been proven 
for B*, it would not have been any easier to prove, and it would not have given us any insight into other special cases of M . 

Example 1122. Consider the set of 2x2 real matrices, M2x2(IR)> with the operation of matrix multiplication. In this context. Theorem 11.2.1 
can be interpreted as saying that if AS = B A, then (AB)^ = A^B^. One pair of matrices that this theorem appUes to is [ ^ M and 




LEVELS OF ABSTRACTION 

One of the fundamental tools in mathematics is abstraction. There are three levels of abstraction that we wiU identify for algebraic systems: 
concrete, axiomatic, and universal. 

Concrete LeveL Almost all of the mathematics that you have done in the past was at the concrete level. As a rule, if you can give examples of a 
few typical elements of the domain and describe how the operations act on them, you are describing a concrete algebraic system. Two examples 

of concrete systems are B* and M2x2(IR)- A few others are: 

(a) The integers with addition. Of course, addition isn't the only standard operation that we could include. Technically, if we were to add 
multiplication, we would have a different system. 

(b) The subsets of the natural numbers, with union, intersection, and complementation. 

(c) The complex numbers with addition and multipUcation. 

Axiomatic Level. The next level of abstraction is the axiomatic level. At this level, the elements of the domain are not specified, but certain 
axioms are stated about the number of operations and their properties. The system that we called M is an axiomatic system. Some combinations 
of axioms are so common that a name is given to any algebraic system to which they apply. Any system with the properties of M is called a 
monoid. The study of M would be called monoid theory. The assumptions that we made about M, associativity and the existence of an identity, 
are called the monoid axioms. One of your few brushes with the axiomatic level may have been in your elementary algebra course. Many 
algebra texts identify the properties of the real numbers with addition and multiplication as the field axioms. As we wiU see in Chapter 16, 
"Rings and Fields," the real numbers share these axioms with other concrete systems, all of which are called fields. 

Universal Level. The final level of abstraction is the universal level. There are certain concepts, called universal algebra concepts, that can be 
appUed to the study of aU algebraic systems. Although a purely universal approach to algebra would be much too abstract for our purposes. 
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defining concepts at this level should make it easier to organize the various algebraic theories in your own mind. In this chapter, we will 
consider the concepts of isomorphism, subsystem, and direct product. 

GROUPS 

To illustrate the axiomatic level and the universal concepts, we will consider yet another kind of axiomatic system, the group. In Chapter 5 we 
noted that the simplest equation in matrix algebra that we are often called upon to solve is AX = B, where A and B are known square matrices 
and X is an unknown matrix. To solve this equation, we need the associative, identity, and inverse laws. We call the systems that have these 
properties groups. 

Definition: Group. A group consists of a nonempty set G and an operation * onG satisfying the properties 

(a) * is associative on G: (a *b) *c = a *(b *c) for all a, b, c eG. 

(b) There exists an identity element, e £ G such that a*e = e *a = a for all a e G. 

(c) For all a e G, there exists an inverse, there exist b E G such that a *b = b *a = e. 

A group is usually denoted by its set's name, G, or occasionally by [G; * ] to emphasize the operation. At the concrete level, most sets have a 
standard operation associated with them that will form a group. As we will see below, the integers with addition is a group. Therefore, in group 
theory Z always stands for [Z; +]. 

Generic Symbols. At the axiomatic and universal levels, there are often symbols that have a special meaning attached to them. In group theory, 
the letter e is used to denote the identity element of whatever group is being discussed. A little later, we will prove that the inverse of a group 
element, a, is unique and it is inverse is usually denoted a"' and is read "a inverse." When a concrete group is discussed, these symbols are 
dropped in favor of concrete symbols. These concrete symbols may or may not be similar to the generic symbols. For example, the identity 
element of the group of integers is 0, and the inverse of n is denoted by -n, the additive inverse of n. 

The asterisk could also be considered a generic symbol since it is used to denote operations on the axiomatic level. 
Example 11.2.3. 

(a) The integers with addition is a group. We know that addition is associative. Zero is the identity for addition: + « = n + = « for all 
integers n. The additive inverse of any integer is obtained by negating it. Thus the inverse of nis -n. 

(b) The integers with multiplication is not a group. Although multiplication is associative and 1 is the identity for multiplication, not all 
integers have a multiplicative inverse in Z. For example, the multiplicative inverse of 10 is — , but — is not an integer. 

(c) The power set of any set U with the operation of symmetric difference, ©, is a group. If A and B are sets, then 
A(B B = (A (j B) - (A f] B) . We will leave it to the reader to prove that © is associative over 'P(U). The identity of the group is the empty set: 
Affi = A. Every set is its own inverse since A ® A = 0. Note that f (f/) is not a group with union or intersection. 

Definition: Abelian Group. A group is abelian if its operation is commutative . 

Most of the groups that we will discuss in this book will be abelian. The term abelian is used to honor the Norwegian mathematician N. Abel 
(1802-29), who helped develop group theory. 




Norwegian Stamp honoring Abel 



EXERCISES FOR SECTION 11.2 
A Exercises 

1 . Discuss the analogy between the terms generic and concrete for algebraic systems and the terms generic and trade for prescription drugs. 

2. Discuss the connection between groups and monoids. Is every monoid a group? Is every group a monoid? 
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3 . Which of the following are groups? 

(a) B* with concatenation (Example 11.2.1a). 

(b) ^2x3(IR) with matrix addition. 

(c) M2x3(IR) with matrix multiplication. 

(d) The positive real numbers, R*, with multiplication. 

(e) The nonzero real numbers, R', with multipUcation. 

(f) {1, -1} with multiplication. 

(g) The positive integers with the operation M defined by a M ^) = larger of a and 6. 

4. Prove that, ©, defined by A ® B = {A \J B) - (A f] B) is an associative operation on f (f/). 

5. The following problem supplies an example of a non-abelian group. A rook matrix is a matrix that has only O's and I's as entries such that 
each row has exactly one 1 and each column has exactly one 1. The term rook matrix is derived from the fact that each rook matrix represents 
the placement of n rooks on an n xn chessboard such that none of the rooks can attack one another. A rook in chess can move only vertically or 
horizontally , but not diagonally . Let R„ be the set of n X n rook matrices . There are six 3 x 3 rook matrices : 





1 





\ 




(0 


1 


> 




(0 





1 \ 


/ = 





1 





Ri = 
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R2 = 


1 
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f 1 
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1 ^ 
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Fi = 
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1 










,0 


1 







.1 





, 




,0 





1 , 



(a) List the 2 X 2 rook matrices. They form a group, R2, under matrix multipUcation. Write out the multiplication table. Is the group abeUan? 

(b) Write out the multipUcation table for . This is another group. Is it abeUan? 

(c) How many 4x4 rook matrices are there? How many n X n rook matrices are there? 

6. For each of the following sets, identify the standard operation that results in a group. What is the identity of each group? 

(a) The set of all 2 x 2 matrices with real entries and nonzero determinants. 
Qo) The set of 2 x 3 matrices with rational entries. 

B Exercises 

7. Let V = [e, a, b, c). Let * be defined (partially) hy x * x = e for all x e V. Write a complete table for * so that [V; * ] is a group. 
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1 11.3 Some General Properties of Groups 



In this section, we will present some of the most basic theorems of group theory. Keep in mind that each of these theorems tells us something 
about every group. We will illustrate this point at the close of the section. 

Theorem 113.1. The identity of a group is unique. 

One difficulty that students often encounter is how to get started in proving a theorem like this. The difficulty is certainly not in the theorem's 

complexity. Before actually starting the proof, we rephrase the theorem so that the implication it states is clear. 

Theorem 113.1 (Repfirased). If G = [G; *] is a group and e is an identity ofG, then no other element ofG is an identity of G. 

Proof (Indirect): Suppose that / e G, f 4= e, and /is an identity of G. We will show that / = e, a contradiction, which completes the 
proof: 

f = f * e Since e is an identity. 

= e. Since /is an identity. ■ 

Theorem 113 J. The inverse of any element of a group is unique. 

The same problem is encountered here as in the previous theorem. We will leave it to the reader to rephrase this theorem. The proof is also left 
to the reader to write out in detail. Here is a hint: If b and c are both inverses of a, then you can prove that 6 = c. If you have difficulty with 
this proof, note that we have already proven it in a concrete setting in Chapter 5. 

The significance of Theorem 11.3.2 is that we can refer to the inverse of an element without ambiguity. The notation for the inverse of a is 
usually a~' . (note the exception below). 
Example 11.3.1. 

(a) In any group, e"' is the inverse of the identity e, which always is e. 

(b) (a-')"' is the inverse of a ' , which is always equal to a (see Theorem 11 .3.3 below). 

(c) (x*>'*z)"' is the inverse of X * y * z. 

(d) In a concrete group with an operation that is based on addition, the inverse of a is usually written -a. For example, the inverse of ^ - 3 
in the group [Z; +] is written -(k - 3) = 3 - ^. In the group of 2 x 2 matrices over the real numbers vmder matrix addition, the inverse of 



(l _3 ) is written -^^ j, which equals ^ _j j. 



Theorem 1133. If a is an element of group G, then ' = a. 

Theorem 1133 (Rephrased). If a has inverse b and b has inverse c, then a = c. 

Proof: 

a = a * (b * c) because c is the inverse of b 
= (a * b) * c why? 
= e * c why? 

= c. by the identity property of e. ■ 

Theorem 113.4. If a and b are elements of group G, then (a *bY' = b'' *a~' 

Note: This theorem simply gives you a formula for the inverse of a * b. This formula should be familiar. In Chapter 5 we saw that if A 
and B are invertible matrices, then (A By^ = B"' A"' . 

Proof: Letx = *a~'. We will prove that x inverts a * fe. Since we know that the inverse is unique, we wiU have prove the theorem. 

(a * b) * X = (a * b) * (b~^ *a~^) 

= a* (b*(b^^ *a ')) 
= a*((b*b^^)*a^^) 
= a * (e * a'^) 
= a * a~^ 
= e 

Similarly, x * (a * b) = e; therefore, (a *fo)~' =x = b'^ *a~' ■ 

Theorem 1133. Cancellation Laws. If a, b, and c are elements of group G, both a * b = a * c and b * a = c * a imply that b = c. 

Proof: Since a * = a * c, we can operate on both a * b and a * c on the left with a'^ : 
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a ^* (a * b) = a ' * (a * c) 
Applying the associative property to both sides we get 

(a~^ * a) * b = (a~^ * a) * c 

or 

e ^ b = e * c 

and finally 

b = c. 

This completes the proof of the left cancellation law. The right law can be proven in exacfly the same way. ■ 

Theorem 113.6. Linear Equations in a Group. If G is a group and a, b, e G, the equation a * x = b lias a unique solution, 
X = a ' * b. In addition, the equation x * a = b has a unique solution, x = b * a'' . 
Proof: (for a * x = b): 

a*x = b 

= e * b 

= (a* a"') * b 
= a * (a"' * b) 

By the cancellation law, we can conclude that x = a * b. 

If c and d are two solutions of the equation a * x = b, then a*c = b = a*d and, by the cancellation law, c = d. This verifies that a'^ * b 
is the only solution of a * x = b. ■ 

Note: Our proof of Theorem 11 .3.6 was analogous to solving 4x = 9 in the following way: 

4x = 9 = (4.i)9 = 4(i9) 
Therefore, by cancelling 4, 




Exponentiation in a Group 

If a is an element of a group G, then we establish the notation that 

a * a = a^ 
a*a*a = a^ 
etc. 

In addition, we allow negative exponent and define, for example, a"^ = (a^)"' 

Although this should be clear, proving exponentiation properties requires a more precise recursive definition: 

Definition: Exponentiation in a Group. For n > 0, define a" recursively by a^ = e and if n > 0, (f = (f~' *a. Also, if n >I, 
a-" = (a")-' . 

Example 11.3.2. 

(a) In the group of positive real numbers with multiplication, 

5^-5 =(5'-5)-5 = ((5°-5)-5)-5 = ((l-5)-5)-5= 5 •5-5 = 125. 

and 

5-3 =(125)-'=^ 

(b) In a group with addition, we use a different form of notation, reflecting the fact that in addition repeated terms are multiples, not powers. 
For example, in [Z; +], a + a is written as 2a, a + a + a is written as 3a, etc. The inverse of a multiple of a such as 
-{a + a + a + a + a) = -(5 a) is written as (-5) a. 

Although we define, for example, a' = a^ * a, we need to be able to extract the single factor on the left. The following lemma justifies doing 
precisely that. 

Lemma. Let G be a group. IfbeG and n >0, then b"^' = b* b", and hence b * If = If *b. 
Proof (by induction): If n = 0. 
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= b''*b by the definition of exponentiation 

= e*b basis for exponentiation 

= b * e identity property 

= b * h'^ basis for exponentiation 

Now assume the formula of the lemma is true for some n>0, 

= ^(«+i) ^ ^ j,y tijg definition of exponentiation 
= {b*b")*b by the induction hypothesis 
= b-*{b''*b) associativity 
= b*(b"*^) definition of exponentiation ■ 

Based on the definitions for exponentiation above, there are several properties that can be proven. They are all identical to the exponentiation 

properties from elementary algebra. 

Theorem 113.7. Properties of Exponentiation. If a is an element of a group G, and n and m are integers, 

(a) a^" = [a^'J and hence ia")~' = [a'')" 

(b) a"+'" = a"*a'" 

(c) {a"r = a"" 

We win leave the proofs of these properties to the interested reader. All three parts can be done by induction. For example the proof of (b) 

would start by defining the proposition p{m) , m > 0, to be a"*™ = a"*a'" for all n . The basis is p{0) : a"*" = a''*a°. 

Our final theorem is the only one that contains a hypothesis about the group in question. The theorem only applies to finite groups. 

Theorem 113.8. If G is a finite group, \G\ = n, and a is an element of G, then there exists a positive integer m such that cT = e and 
m <n. 

Proof: Consider the Ust a, a^, a"*' . Since there are n + 1 elements of G in this list, there must be some dupUcation. Suppose that 
a'' = a*, with p <q. Let m = q - p. Then 

a'n=a'i-P = (fl*a-f = a«*(a'')-^ = a« *(a«)-' = e 

Furthermore, since l<p<q<n+l, m = q - p < n. ■ 

Consider the concrete group [Z; +]. All of the theorems that we have stated in this section except for the last one say something about Z. 
Among the facts that we conclude from the theorems about Z are: 

Since the inverse of 5 is -5, the inverse of -5 is 5. 
The inverse of -6 + 71 is -(71) + -(-6) = -71 + 6. 
The solution of 12 + ;c = 22is;c = -12 + 22. 
-4(6) + 2(6) = (-4 + 2) (6) = -2(6) = -(2) (6)). 
7(4(3)) = (7 -4) (3) = 28(3) (twenty-eight 3s). 



EXERCISES FOR SECTION 11.3 
A Exercises 

1 . Let [G; * ] be a group and a be an element of G. Define / : G -» Gby f(x) = a * x. 

(a) Prove that/is a bijection. 

(b) On the basis of part a, describe a set of bijections on the set of integers. 

2. Rephrase Theorem 11 .3.2 and write out a clear proof. 

3. Prove by induction on n that if oi , a2> a„ are elements of a group G,n>2, then 

(ai *a2 * ••• *a„)"' = a„"' * ••• *a2"' *ai"'. 
Interpret this result in terms of [Z; +] and [R; *]. 

4. True or false? If a, c are elements of a group G, and a * b = c * a, then b = c. Explain your answer. 

5 . Prove Theorem 1 1 .3 .7 . 
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6. Each of the following facts can be derived by identifying a certain group and then applying one of the theorems of this section to it. For 
each fact, list the group and the theorem that are used. 

(a) ( 5 is the only solution of 3 x = 5 . 

(b) -(-(-18)) = -18. 

(c) If A, B, C are 3 x3 matrices over the real numbers, with A + B = A + CthenB = C. 

(d) There is only one subset of the natural numbers for which S" © A = A for every A cN. 
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1 11.4 Greatest Common Divisors and the Integers Modulo n 

In this section introduce the greatest common divisor operation and will introduce an important family of concrete groups. 
Greatest Common Divisors 

We start with a theorem about integer division that is intuitively clear. We leave the proof as an optional exercise. 

The Division Property for Integers. Ifm, n e Z, n >0, then there exist two unique integers, q (quotient) and r (remainder), such that 
m = nq + randO < r < n. 

Note: The division property says that if m is divided by n, you will obtain a quotient and a remainder, where the remainder is less than n. 
This is a fact that most elementary school students learn when they are introduced to long division. In doing the division problem 1986 97, 

you obtain a quotient of 20 and a remainder of 46. This result could either be written = 20 H or 1986 = 97-20 + 46. The later 

J ^ 97 97 

form is how the division property is normally expressed. 

If r = 0, i. e., a = bq, then all of the following say the same thing 
b divides a 
a is a multiple of b 
bis a factor of a 
bis a divisor of a 

Notation We use the notation b \ a if b divides a. For example 2 | 18 and 9 | 18 , but 4 j- 18 

Caution: Don't confuse the "divides" symbol with the "divided by" symbol. The former is vertical while the later is slanted. Notice that the statement 2 1 18 is 
related to the fact that 18/2 is a whole number. 



Definition: Greatest Common Divisor. Given two integers, a and b, not both zero. The greatest common divisor of a and b is the 
integer g such that g\a, g\b, and 

c I a and c \ b => c \ g 

A little simpler way to think of gcd(a, b) is as the largest positive integer that is a divisor of both a and h. 

For small numbers, a simple way to determine the greatest common divisior is to use factorization. For example if we want the greatest 
common divisor of 660 and 350, you can factor the two integers: 660 = 2^x3 x 5 x 11 and 350 = 2 x 5^x7. Single factors of 2 and 5 are 

the only ones that appear in both factorizations, so the greatest common divisor is 2 x 5 =10. 

Relatively Prime Pairs. Some pairs of integers have no common divisors other than 1. Such pairs are called relatively prime pairs. For 
example, 128 = 2' and 135 = 3^ 5 are relatively prime. Notice that neither 128 nor 135 are primes. In general, a and h need not be prime in 
order to be relatively prime. However, if you start with a prime, like 23, for example, it wMl be relatively prime to everything but its multiples. 
This theorem, which we prove later generalizes this observation: 

Theorem. Ifp is a prime and a is any integer such that pa a then gcd{a, p) = 1 

The Euclidean Algorithm 

As early as Euclid's time it was known that factorization wasn't the best way to compute greatest common divisors. 
The Euclidean Algorithm is based on the following properties of the greatest common divisor 

gcd(a, 0) = a for a + 

gcd(a, b) = gcd(6, r) ifb + O and a = bq + r 

To compute gcd(fl, b), we divide b into a and get a remainder r such that < r < | |. By the property above, gcd(a, b) = gcd(&, r) . We 
repeat the process until we get zero for a remainder. The last nonzero number that is the second entry in our pairs is the greatest common 
divisior. This is inevitable because the second number in each pair is smaller than the previous one. 

Here is the computation to verify that gcd(99, 53) = 1. At each line, the value of a is divided by the value of b. The quotient is placed on the 
next line along with the new value of a, which is the previous b; and the remainder, which is the new value of b. 



q 


a 


b 




99 


53 


1 


53 


46 


1 


46 


7 


6 


7 


4 


1 


4 


3 


1 


3 


1 


3 


1 






If you were allowed to pick two numbers less than 100, which would you pick in order to force Euclid to work hardest? Here's a hint 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Nonconmierical-ShareAlike 3.0 United States License. 



266 



Chapter 11 - Algebraic Systems 



q a b 

34 21 

1 21 13 

1 13 8 

1 8 5 

1 5 3 

1 3 2 

1 2 1 

2 1 

For fixed values of a and h, consider integers of the form ax + by where x and y can be any two integers. For example if a = 36 and b = 27, 

some of these results are tabulated below with x values along the left column and the y values on top. 

Notice any patterns? What is the smallest positive value the you see in this table? How is it connected to 36 and 27 

y 





-6 




-324 


-297 


-2 


-1 





■ 

-189 




3 


4 


f 5 


el 


-6 


-378 


-351 


-270 


-243 


-216 


-162 


-135 


-108 


-81 


-54 




-342 


-315 


-288 


-261 


-234 


-207 


-180 


-153 


-126 


-99 


-72 


-45 


-18 


-4 


-306 


-279 


-252 


-225 


-198 


-171 


-144 


-117 


-90 


-63 


-36 


-9 


18 


^^^H -270 


-243 


-216 


-189 


-162 


-135 


-108 


-81 


-54 


-27 


mm 


27 


54 


-2 


-234 


-207 


-180 


-153 


-126 


-99 


-72 


-45 


-18 


9 


36 


63 


90 


-1 


-198 


-171 


-144 


-117 


-90 


-63 


-36 


-9 


18 


45 


72 


99 


126 





-162 


-135 


-108 


-81 


-54 


-27 





27 


54 


81 


108 


135 


162 


1 


-126 


-99 


-72 


-45 


-18 


9 


36 


63 


90 


117 


144 


171 


198 


2 


-90 


-63 


-36 


-9 


18 


45 


72 


99 


126 


153 


180 


207 


234 


3 


-54 


-27 




27 


54 


81 


108 


135 


162 


189 


216 


243 


270 


4 


-18 


9 


36 


63 


90 


117 


144 


171 


198 


225 


252 


279 


306 


5 


18 


45 


72 


99 


126 


153 


180 


207 


234 


261 


288 


315 


342 




54 


81 


108 


135 


162 


189 


216 


243 


270 


297 


324 


351 


378 



Theorem 11.4.1. If a and b are positive integers, the smallest positive value of ax + by is the greatest common divisor of a and b, 
gcd(a, b). 

Proof: \i g = gcd(a, b), then g \ a and g\b ^ g\(ax + by)ior any x and j, so a x + by can't be less than g. To show that g is exactly the 
least positive value, we show that g can be attained by extending the Euclidean Algorithm. Performing the extended algorithm involves 
building a table of numbers. There are many variations on the way that this table arranged, so if your book has this algorithm it may look 
slightly different. 

The table for gcd(152,53) is below. In the V" column, you will find 152 and 53, and then the successive remainders from division. So each 
number in "r" after the first two is the remainder after dividing the number immediately above it into the next number up. To the left of each 
remainder is the quotient from the division. So in this case the third row of the table tells us that 152 — 53 x 2 + 46. The last nonzero value in 
r is the greatest common divisor. 

The "s" and "t" columns are new. The values of s and t in each row are maintained so that 
152s + 53t is equal to the number in the "r" column. Notice that 

152 = 152 \ 1 + 53x0 

53 = 152 x + 53x1 

46 = 152 X 1 + 53 (-2) 

1 = 152x 15 + 53 (-43) 
= 152 (-53) + 53x 152 
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<3 


IT 


s 


4- 

T. 




ICO 


1 


U 




53 





1 


2 


46 


1 


-2 


1 


7 


-1 


3 


6 


4 


7 


-20 


1 


3 


-8 


23 


1 


1 


15 


-43 


3 





-53 


152 



The next-to-last equation is what we're looking for in the end! The main problem is to identify how to determine these values after the first two 

rows. The first two rows in these columns will always be the same. 

Let's look at the general case of computing gcd(a,b). If the s and t values in rows i - 1 and / - 2 are correct, we have 
I aSi-2 + bti-2 = fi-2 

In addition, we know that 

r,_2 = /•,_! 9, + r, => ri = ri_2-ri_iq, 

If you substitute the expressions for and r,_2 from (A) into this last equation and then collect the a and b terms separately you get 
ri= a(Si-2- qiSi-i) + &(fi-2 - qiti-i) 

or 

Si = Si^2 - qiSi-i and U = f,_2 - qth-i 

Look closely at the equations for r,, j,-, and f,. Their forms are all the same. With a little bit of practice you should be able to compute s and t 
values quickly. 

Modular Arithmetic 

If two numbers, a and b, share the same remainder after dividing by n. we say that they are congruent modulo n, denoted a = b (modn). For 
example, 13 = 38 (mod 5) because 13 = 5-2 + 3 and 38= 5- + 3. 

Modular Arithmetic. If n is a positive integer, we define the operations of addition modulo n {+„) and multiplication modulo n (x„) as 
follows. If a, b G 

a +„ b = the remainder after a + b is divided by n 
a x„b = the remainder after a ■ b is divided by n. 
Notes: 

(a) The result of doing arithmetic modulo n is always an integer between and n - 1, by the Division Property. This observation implies that 
{0, 1 , n - 1) is closed under modulo n arithmetic. 

(b) It is always true that a +„ = (a + fo) (mod n) and a x„ 6 = (a •&)(modn). For example, 4+75 = 2 = 9 (mod 7) and 

4 X7 5 = 6 = 20(mod7). 

(c) We win use the notation Z„ to denote the set {0, 1, 2, . . ., n - 1}. 

Properties of lUlodular Arithmetic on 

Addition modulo n is always commutative and associative; is the identity for +„ and every element of Z„ has an additive inverse. 
Multiplication modulo n is always commutative and associative, and 1 is the identity for x„. 

Theorem 11.42. If a e Z„, a ^0, then the additive inverse of a is n - a. 

Proof: a + (n-a)=n = (mod n) , since n = n • 1 + 0. Therefore, a +„ (n - a) = ■ 
Note: The algebraic properties of +„ and x„ on Z„ are identical to the properties of addition and multiplication on Z. 
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The Group 2„. For each n> I, [Z„; +„] is a group. Henceforth, we will use the abbreviated notation Z„ when referring to this group. Figure 
1 1 .4.1 contains the tables for Zi through Zg. 



1 2 

12 

112 
2 2 1 

Li 

I I 1 I 2 3 4 I 5 

I 

1 2 3 4 5 

1 1 2 3 4 5 

2 2 3 4 5 1 

3 3 4 5 1 2 

4 4 5 1 2 3 

5 5 1 2 3 4 

Figure 11.4.1 
Addition tables for Z„, 1<n<6. 

Example 11.4.1. 

(a) We are all somewhat familiar with Z12 since the hours of the day are counted using this group, except for the fact that 12 is used in place 
of 0. Military time uses the mod 24 system and does begin at 0. If someone started a four-hour trip at hour 21, the time at which she would 
arrive is 21 +24 4 = 1. If a satellite orbits the earth every four hours and starts its first orbit at hour 5, it would end its first orbit at time 
5 +24 4 = 9. Its tenth orbit would end at 5 +24 7 X244 = 9 hours on the clock 

(b) Virtually all computers represent unsigned integers in binary form with a fixed number of digits. A very small computer might reserve 
seven bits to store the value of an integer. There are only 2^ different values that can be stored in seven bits. Since the smallest value is 0, 
represented as 0000000, the maximum value will be 2^ - 1 = 127, represented as 1111111. When a command is given to add two integer 
values, and the two values have a sum of 128 or more, overflow occurs. For example, if we try to add 56 and 95, the sum is an eight-digit 
binary integer 10010111. One common procedure is to retain the seven lowest-ordered digits. The result of adding 56 and 95 would be 
OOlOllltwo = 23 = 56 4- 95 (mod 128). Integer arithmetic with this computer would actually be modulo 128 arithmetic. 

Mathematica Note 

In Mathematica you can get the gcd of two numbers using the function GCD: 
GCD[660, 350] 
10 

A related function, ExtendedGCD, provides the x and y values guaranteed in Theorem 1 1 .4.2. 
ExtendedGCD [1001, 231] 

{77, {1, -4}} 

Most computer languages have a "mod" function that computes the remainder when one integer is divided by another. Mathematica is no 
exception. To determine the remainder upon dividing 1986 by 97 we can evaluate 

Mod[1986, 97] 

46 

A mod 6 addition function can be defined based on Mod with the following input: 

Plus6[a_, b_] :=Mod[a + b, 6] 

There is a free package called AbstractAlgebra that is available at https://sites.google.com/site/eaamhl/eaam. It contains a function that 
will generate the operation tables, also called Cayley Tables, such you see in Figure 1 1 .4.1 . First load the package, as instructed: 

<< AbstractAlgebra' Master" 

We can form a the group Zj using the FormGroupoid function: 




001 

110 



*4 





1 


2 


3 








1 


2 


3 


1 


1 


2 


3 





2 


2 


3 





1 


3 


3 





1 


2 



1 2 3 4 

1 1 2 3 4 



2 2 3 4 1 

3 3 4 1 2 
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G = FormGroupoid [Range [0, 5] , Plus6] 

Groupoid(|0, 1, 2, 3, 4, 5|, -Operation-) 
Then the function called CayleyTable generates the table for the group Ze: 

CayleyTable [G, BodyColored False, 
HeadingsColored -* False, ShowExtraCayleylnformation -» False] 

TheGroup 

y 



X 



* 





1 


2 


3 


4 


5 








1 


2 


3 


4 


5 


1 


1 


2 


3 


4 


5 





2 


2 


3 


4 


5 





1 


3 


3 


4 


5 





1 


2 


4 


4 


5 





1 


2 


3 


5 


5 





1 


2 


3 


4 



Note: The rules BodyColored -» False, HeadingsColored -» False, ShowExtraCayleylnformation False are included in the input 
above for easier black and white readability. They would not be normally included when using CayleyTable. 

It's actually even easier to generate these tables because the family of Z„ ' s is part of the package. Here is the table for Zg: 

CayleyTable [Z [9] , BodyColored -» False, 
HeadingsColored -> False, ShowExtraCayleylnformation -» False] 

Z[9J 
V 



X 



+ 





1 


2 


3 


4 


5 


6 


7 


8 








1 


2 


3 


4 


5 


6 


7 


8 


1 


1 


2 


3 


4 


5 


6 


7 


8 





2 


2 


3 


4 


5 


6 


7 


8 





1 


3 


3 


4 


5 


6 


7 


8 





1 


2 


4 


4 


5 


6 


7 


8 





1 


2 


3 


5 


5 


6 


7 


8 





1 


2 


3 


4 


6 


6 


7 


8 





1 


2 


3 


4 


5 


7 


7 


8 





1 


2 


3 


4 


5 


6 


8 


8 





1 


2 


3 


4 


5 


6 


7 



Sage Note 

In Sage, gcd is the greatest common divisor function. It can be used in two ways. For the gcd of 2343 and 4319 we can evaluate the expres- 
sion gcd( 2343 , 4319 ) . If we are working with a fixed modulus m that has a value established in your Sage session, the expression 
m. gcd( k) to compute the gcd of m and any integer value k. 

Sage has some extremely powerful tool for working with groups. The integers modulo n are represented by the expression Integers ( n) and 
the addition and multiplications tables can be generated as follows. 
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R = Integers (6) 

print R.addition_table( 'elements' ) 
print R.multiplication_table( 'elements' ) 



+ 1 2 3 4 5 
+ 

1 2 3 4 5 

1 1 2 3 4 5 

2 2 3 4 5 1 

3 3 4 5 1 2 

4 4 5 1 2 3 

5 5 1 2 3 4 

* 1 2 3 4 5 
+ 



1 1 2 3 4 5 

2 2 4 2 4 

3 3 3 3 

4 4 2 4 2 

5 5 4 3 2 1 

Once we have assigned R a value of Integers ( 6 ) , we can do calculations by wrapping R( ) around the integers through 5. Here is a list 
containing the mod 6 sum and product, respectively, of 5 and 4: 



[R(5)+R(4), R(5)*R(4)] 

[3, 2] 

EXERCISES FOR SECTION 11.4 
A Exercises 

1 . Determine the greatest common divisors of the following pairs of integers without using any computational assistance. 

(a) 2^ 3^ 5 and 2^ 3 5^ 7 

(b) 2x3x4x5x6x7 and 3x5x7x9x11x13 

(c) 19"* and 19' 

(d) 121 12 and 

2. Find all possible values of the following, assuming that m is a positive integer. 

(a) gcd (m + 1 , m) 

(b) gcd (m + 2, m) 

(c) gcd {m + 4, m) 

3. Calculate: 

(a) 7 +8 3 

(b) 7 X8 3 

(c) 4X84 

(d) 10+12 2 

(e) 6X82 +8 6X85 

(f) 6X8(2 +8 5) 

(g) 3 X5 3 X5 3 Xj 3 = 3'*(mod5) 

(h) 2x„7 

(i) 2xi47 

4. List the additive inverses of the following elements: 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed imder a Creative Commons Attribution-Noncommerical-ShareAlike 3.0 United States License. 



Chapter 11 - Algebraic Systems 



271 



(a) 4,6,9mZio 

(b) 16,25,40mZ3o 

5. In the group Zn , what are: 

(a) 3(4)? 

(b) 36(4)? 

(c) How could you efficiently compute m (4), m e Z? 

6 . Prove that {l,2,3,4}isa group under the operation X5 . 

7. A student is asked to solve the following equations under the requirement that all arithmetic should be done in Z2. List all solutions. 

(a) + 1 = 0. 

(b) + X + I = 0. 

8. Determine the solutions of the same equations as in Exercise 5 in Z5. 

B Exercises 

9. Prove the division property by induction on m. 

10. Prove that congruence modulo n is an equivalence relation on the integers. Describe the set of equivalence classes that congruence modulo 
n defines. 
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1 11.5 Subsystems 

The subsystem is a fundamental concept of algebra at the universal level. 

Definition: Subsystem. If [V; . . ., *„] is an algebraic system of a certain kind and W is a subset of V, then W is a subsystem of V if 
[W; */, . . ., *„] is an algebraic system of the same kind as V. The usual notation for "W is a subsystem ofV" is W < V. 

Since the definition of a subsystem is at the universal level, we can cite examples of the concept of subsystems at both the axiomatic and 
concrete level. 

Example 11^.1 

(a) (Axiomatic) If [G; *] is a group, and H is a subset of G, then His a subgroup of G if [H; *] is a group. 

0?) (Concrete) U = {- 1 , 1} is a subgroup of [R* ; •] . Take the time now to write out the multiplication table of U and convince yourself that 
[U;-] is a group. 

(c) (Concrete) The even integers, 2 Z = [2k : fe is an integer} is a subgroup of [Z; +]. Convince yourself of this fact. 

(d) (Concrete) The set of nonnegative integers is not a subgroup of [Z; +]. All of the group axioms are true for this subset except one: no 
positive integer has a positive additive inverse. Therefore, the inverse property is not true. Note that every group axiom must be true for a 
subset to be a subgroup. 

(e) (Axiomatic) If M is a monoid and P is a subset of M, then P is a submonoid of M if P is a monoid. 

(f) (Concrete) If B* is the set of strings of O's and I's of length zero or more with the operation of concatenation, then two examples of 
submonoids of B* are: (i) the set of strings of even length, and (ii) the set of strings that contain no O's. The set of strings of length less than 50 
is not a submonoid because it isn't closed under concatenation. Why isn't the set of strings of length 50 or more a submonoid of B*? 

For the remainder of this section, we will concentrate on the properties of subgroups. The first order of business is to establish a systematic way 
of determining whether a subset of a group is a subgroup. 

Theorem/Algorithm 11.5.1. To determine whether H, a subset of group [G; *], is a subgroup, it is sufficient to prove: 

(a) H is closed under *; that is, a, h e H => a * b E H; 

(b) H contains the identity element for *; and 

(c) H contains the inverse of each of its elements; that is, a e H => a~' e H. 

Proof: Our proof consists of verifying that if the three properties above are true, then all the axioms of a group are true for [H ; *J. By Condi- 
tion (a), * can be considered an operation on H. The associative, identity, and inverse properties are the axioms that are needed. The identity 
and inverse properties are true by Conditions (b) and (c), respectively, leaving only the associative property. Since, [G; *] is a group, 
a * (h * c) = (a * fo) * c for all a, b, c e G. Certainly, if this equation is true for all choices of three elements from G, it wiU be true for all 
choices of three elements from H, since His a subset of G. ■ 

For every group with at least two elements, there are at least two subgroups: they are the whole group and [e]. Since these two are automatic, 
they are not considered very interesting and are called the improper subgroups of the group; [e} is sometimes referred to as the trivial subgroup. 
All other subgroups, if there are any, are called proper subgroups. 

We can apply Theorem 1 1.5.1 at both the concrete and axiomatic levels. 
Examples 11S2. 

(a) (Concrete) We can verify that 2Z < Z, as stated in Example 11.5.1. Whenever you want to discuss a subset, you must find some 
convenient way of describing its elements . An element of 2 Z can be described as 2 times an integer; that is , a e 2 Z is equivalent to 
(3k);^(a = 2k). Now we can verify that the three conditions of Theorem 11.5.1 are true for 2Z. First, ii a, h e 2Z, then there exist 
j, k e ^ such that a = 2j and b = 2k. A common error is to write something like a = 2j and b = 2j. This would mean that a = b, 
which is not necessarily true. That is why two different variables are needed to describe a and b. Returning to our proof, we can add a and 
b: 

a + b = 2 j + 2k = 2{j + k). 

Since j + k is an integer, a + b is an element of 2Z. Second, the identity, 0, belongs to 2Z (0 = 2(0)). Finally, if a e 2Z and 
a = 2k, -a = -(2k) = 2(-yt),and -keZ, therefore, -a e 2 Z. By Theorem 11.5.1, 2 Z < Z. 

How would this argument change if you were asked to prove that 3 Z < Z? orn Z < Z, n > 2? 

(b) (Concrete) We can prove that H = {0, 3, 6, 9) is a subgroup of Z12 . First, for each ordered pair (a, b) e H x H, a +12 b is in H. 
This can be checked without too much trouble since \H xH\ = 16. Thus we can conclude that H is closed under +12. Second, e if. Third, 
-0 = 0,-3 = 9,-6 = 6, and -9 = 3.Therefore, the inverse of each element in /f isiaH. 

(c) (Axiomatic) If H and K are both subgroups of a group G, then H C\ Kisa subgroup of G. To justify this statement, we have no concrete 
information to work with, only the facts that H < G and K <G. Our proof that H P\ K < G reflects this and is an exercise in applying the 
definitions of intersection and subgroup, (i) If a and b are elements of H H K, then a and b both belong to H, and since H < G,a * b must be 
an element of H. Similarly, a * b e K"; therefore, a * b & H f] K. (ii) The identity of G must belong to both H and K; hence it belongs to 
H n K.(iii)Jta eH fl^^.thena e if, and since if < G,a-^ e if. Similarly, a"' e JsT. Hence, by the theorem, H f] K < G. 
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Now that this fact has been estabhshed, we can apply it to any pair of subgroups of any group. For example, since 2Z and 3 Z are both 
subgroups of [Z; +], 2 Z f] 3 Z is also a subgroup of Z. Note that ifae2Zn3Z, a must have a factor of 3; that is, there exists e Z 
such that a = 3k. In addition, a must be even, therefore k must be even. There exists y 6 Z such that k = 2j, therefore a = 3 (2 y) = 6 j. 
This shows that 2Zn 3Zc 6Z. The opposite containment can easily be established; therefore, 2Z f] 3Z = 6Z. 

Given a finite group, we can apply Theorem 1 1 .3 .7 to obtain a simpler condition for a subset to be a subgroup. 

Theorem/Algorithm 11^2. If [G; * ] is a finite group, H is a nonempty subset ofG, and you can verify that H is closed under * , then H 
is a subgroup ofG. 

Proof: In this proof, we demonstrate that Conditions (b) and (c) of Theorem 11.5.1 follow from the closure of H under * , which is 
Condition (a). First, select any element of H; call it fi. The powers of yS : yS^, ... are all in H by the closure property. By Theorem 

1 1 .3.7, there exists m,m <\G\, such that yS™ = e; hence e e H.To prove that (c) is true, we let a be any element of H. If a = e, then a~^ is in 
H since e'^ = e.K a t e,a'' = e for some q between 2 and |G| and 



e = a 

Therefore, a"' = a''"' 



1 > 1. 1 



, which belongs to H since q 

Example 11.5.3 To determine whether Hi = {0, 5, 10) and H2 = {0, 4 
tables (modulo 15) for these sets. 

Hi H2 

y 

y 



8, 12) are subgroups of Z15 , we need only write out the addition 



+ 





5 


10 








5 


10 


5 


5 


10 





10 


10 





5 








4 


8 


12 








4 


8 


12 


4 


4 


8 


12 


1 


8 


8 


12 


1 


5 


12 


12 


1 


5 


9 



Note that Hi is a subgroup of Z15. Since the interior of the addition table for H2 contains elements that are outside of H2 , H2 is not a subgroup 

0fZi5. 

One kind of subgroup that merits special mention due to its simpUcity is the cycUc subgroup. 

Definition: Cyclic Subgroup Generated by an Element. If G is a group and a e G, the cyclic subgroup generated by a, (a), is the set of 
powers of a and their inverses: 

(a) = {a" .• n e Z") 
A subgroup H is cyclic if there exists a e H such that H = (a). 

Definition: Cyclic Group. A group G is cyclic if there exists J3 eG such that {fi) = G. 
Note: If the operation on G is additive, then (a) = [in) a : n & 1.}. 

Example WSA. 

(a) In[R; •], (2) = {2« : n e Z) = {..., ^, 1, 2, 4, 8, 16, ...}. 

(b) In Zj5, (6) = {0, 3, 6, 9, 12}. If G is finite, you need Ust only the positive powers of a up to the first occurrence of the identity to 
obtain all of (a). In Z^ , the multiples of 6 are 6, (2) 6 = 12, (3) 6 = 3, (4) 6 = 9, and (5) 6 = 0. Note that {0, 3, 6, 9, 12) is also (3), (9), 
and (12). This shows that a cyclic subgroup can have different generators. 

If you want to list the cyclic subgroups of a group, the following theorem can save you some time. 

Theorem 1 1 .5.3. If a is an element of group G, then (a) = (a"'). This is an easy way of seeing that (9) in Z15 equals (6), since -6 = 9. 



EXERCISES FOR SECTION 11.5 
A Exercises 

1 . Which of the following subsets of the real numbers is a subgroup of [R; +]? 

(a) the rational numbers 

(b) the positive real numbers 

(c) {fe/2 I is an integer) 
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(d) {2* I A: is an integer) 

(e) {;c I -100 < ;c < 100} 

2. Describe in simpler terms the following subgroups of Z: 

(a) 5 Z n 4 Z 

(b) 4 Z n 6 Z (be careful) 

(c) the only finite subgroup of Z 

3. Find at least two proper subgroups of R-} , the set of 3 x 3 rook matrices (see Exercise 5 of Section 1 1 .2). 

4 . Where should you place the following in Figure 11.5.1? 

(a) e 

(b) a-i 

(c) X * y 







G 








/a / 

/ • / 


1 y 




\ • \ 







Figure 11.5.1 



5. (a) List the cyclic subgroups of Zg and draw an ordering diagram for 
the relation "is a subset of on these subgroups. 

(b) Do the same for Z12 . 

(c) Do the same for Zg . 

(d) On the basis of your results in parts a, b, and c, what would you expect if you did the same with Z24? 

B Exercises 

6. Subgroups generated by subsets of a group. The concept of a cyclic subgroup is a special case of the concept that we wiU discuss here. Let 
[G; * ] be a group and S a nonempty subset of G. Define the set (S) recursively by: 

(i) lf a eS.thena e (5), 

(ii) If a, b e {S), then a*b e (5), and 

(iii) lfa e(5),thena-i e(S). 

(a) By its definition, (S) has all of the properties needed to be a subgroup of G. The only thing that isn't obvious is that the identity of G is in 
(S). Prove that the identity of G is in (5). 

(b) What is ({9, 15})in[Z; +]? 

(c) Prove that H H < G and S Q H, then (5) < H. This proves that (5) is contained in every subgroup of G that contains S; that is, (S) = f] H . 

ScH 
H<G 

(d) Describe ((0.5, 3)) in [ IR+; •] and in [R; +] . 

(e) If 7, ^ e Z, ({/, ^1) is a cyclic subgroup of Z. In terms of.; and k, what is a generator of ({j, k])l 

7. Prove that HH, K < G,aiidH \J K = G, then H = G or K = G. (Hint: Use an indirect argument.) 
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Our second universal algebraic concept lets us look in the opposite direction from subsystems. Direct products allow us to create larger systems. 
In the following definition, we avoid complicating the notation by not specifying how many operations the systems have. 

Definition: Direct Product. If[Vj;*i, o^, ...], [V2;*2, 02, [Vj; *„, are algebraic systems of the same kind, then the 

direct product of these systems is V = V2 XV2 X ■■■ xV„ , with operations defined below. The elements of V are n-tuples of the form 
(aj, a2, . . . , a„ ), where aj^ e Vj^, k = 1 , . . . , n. The systems Vj, V^. Vn are called the factors ofV. There are as many operations on V 
as there are on the factors. Each of these operations is defined componentwise: 

If(ai, a2, a„), (bi, bi, .... bn)eV, 

(ai, 02, a„)*(bi,b2, b „) ={a,*ib,, a2*2b2, .... a„*„bn) 
(aj, a2, an)o{bj,b2, .... b„) ={a,o,b:, a2^2b2,--- a„o„b„) 

Example 11.6.1. Consider the monoids (the set of natural numbers with addition) and B* (the set of finite strings of O's and I's with 
concatenation). The direct product of fsl with B* is a monoid. We illustrate its operation, which we wiU denote by * , with examples: 

(4, 001) * (3, 11) = (4 + 3, 001 11) = (7, 00111) 
(0, 1 1 010) * (3, 01) = (3, 1 101 001) 

(0, A) * (129, 00011) = (0 + 129, A 00011) = (129, 00011) 

(2, 01) * (8, 10) = (10, 0110), and 

(8, 10) * (2, 01) = (10, 1001). 
Note that our new monoid is not commutative. What is the identity for * ? 
Notes: 

(a) On notation. If two or more consecutive factors in a direct product are identical, it is common to combine them using exponential notation. 
For example, Z x Z x IR can be written 1? x R, and R x IR x IR x R can be written R"*. This is purely a notational convenience; no exponenti- 
ation is really taking place. 

(b) In our definition of a direct product, the operations are called componentwise operations, and they are indeed operations on V. Consider * 
above. If two n-tuples, a and b, are selected from V, the first components of a and b, ai and bi , are operated on with *i to obtain ai*ibi, the 
first component of a * h. Note that since *i is an operation on Vi, ai *ibi is an element of Vi. Similarly, all other components of a * b, as they 
are defined, belong to their proper sets. 

One significant fact about componentwise operations is that the components of the result can all be computed at the same time (concurrently). 
The time required to compute in a direct product can be reduced to a length of time that is not much longer than the maximum amount of time 
needed to compute in the factors (see Figure 1 1 .6.1). 

Time >■ 



fi-bi 




a„-b„ 

Figure 1 1 .6.1 
Concurrent caicuiation in a direct product. 



(c) A direct product of algebraic systems is not always an algebraic system of the same type as its factors. This is due to the fact that certain 
axioms that are true for the factors may not be true for the set of n-tuples. This situation does not occur with groups however. You will find that 
whenever a new type of algebraic system is introduced, call it type T, one of the first theorems that is usually proven, if possible, is that the 
direct product of two or more systems of type T is a system of type T. 

Theorem 11.6.1. The direct product of two or more groups is a group; that is, the algebraic properties of a system obtained by taking the 
direct product of two or more groups includes the group axioms. 

We win only present the proof of this theorem for the direct product of two groups. Some slight revisions can be made to obtain a proof for any 
number of factors. 
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Proof: Stating that the direct product of two groups is a group is a short way of saying that if [Gu *i] and [G2; *2] are groups, then 
[Gi X G2; * ] is also a group, where * is the componentwise operation on Gi x G2. 

Associativity of * : If a, b, c e Gi XG2, 

a * (b * c) = (ai , (12) * iih] , i>2) * (ci , C2)) 

= (ai, a2)*(fcl *lCl, ^)2*2C2) 

= (ai*l(&l*lCl),a2*2(&2*2C2)) 

= ((«! *l''l)*lCl, (a2*2M*2C2) 

= (oi *,hi, a2*2M*(ci, C2) 

= ((fli, fl2)*(^l, &2))*(Cl, C2) 

= (a * ^))*c 

Notice how the associativity property hinges on the associativity in each factor. 

An identity for *: As you might expect, if ei and 62 are identities for Gi and G2, respectively, then e = (ei, 62 ) is the identity for Gi XG2. If 

a e Gi XG2, 

a * e = (ai, 02)* (ei, 62 ) 
= (ai *iei, 02*2 62) 
= (ai, 02) 
= a 

Similarly, e * a = a. 

Inverses in Gi XG2: The inverse of an element is determined componentwise a~' = (ai, 02)"' = (ai~', 02"') . To verify, we compute a * : 

a * = (ai, a2)*(ai"', 02"') 

= (ai*iar', 02*202"') 
= (ei, 62) 
= e 

Similarly, fl"' ■* a = e.M 
Example 11.6.2. 

(a) If n > 2, Z2" , the direct product of n factors of Z2, is a group with 2" elements. We will take a closer look at Z2"' = Z2 x Z2 x Z2. The 
elements of this group are triples of zeros and ones. Since the operation on Z2 is +2, we will use the symbol + for the operation on Z2'' . Two 
of the eight triples in the group are a = (1, 0, 1) and b = (0, 0, 1). Their "sum" is a + = (1 +2 0, +2 0, 1 +2 1) = (1, 0, 0). One 
interesting fact about this group is that each element is its own inverse. For example a + a = (l, 0, 1) + (1, 0, 1) = (0, 0, 0); therefore 
-a = a. We use the additive notation for the inverse of a because we are using a form of addition. Note that {(0, 0, 0), (1, 0, 1)) is a 
subgroup of Z2^. Write out the "addition" table for this set and apply Theorem 11.5.2. The same can be said for any set consisting of (0, 0, 0) 
and another element of Z2^. 

(b) The direct product of the positive real numbers with the integers modulo 4, IR+ x Z4 is an infinite group since one of its factors is infinite. 
The operations on the factors are multipUcation and modular addition, so we wiU select the neutral symbol o for the operation on x Z4. If 
a = (4, 3)and^) = (0.5, 2), then 

aob = (4, 3)o (0.5, 2) = (4 • 0.5, 3+4 2) = (2, 1) 

= bob = (0.5, 2) o (0.5, 2) = (0.25, 0), 
a-i = (4-1 , -3) = (0.25, 1) and 
i-i = (0.5-1 ^ _2) = (2_ 2). 

It would be incorrect to say that Z4 is a subgroup of x Z4 , but there is a subgroup of the direct product that closely resembles Z4. It is 
{(1, 0), (1, 1), (1, 2), (1, 3)}. Its table is 






{1, 0) 


{1, 1) 


{1, 2) 


{1, 3} 


{1, 0} 


{1, 0} 


{1, 1} 


(1, 2) 


{1, 3} 


{1, 1} 


{1, 1} 


{1, 2} 


(1, 3) 


{1, 0} 


{1, 2} 


{1, 2} 


{1, 3} 


{1, 0) 


{1, 1} 


{1, 3} 


{1, 3} 


{1, 0} 


{1, 1} 


{1, 2} 



Imagine erasing (1,) throughout the table and writing +4 in place of «. What would you get? We wiU explore this phenomenon in detail in the 
next section. 
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The whole direct product could be visualized as four parallel half-lines labeled 0,1,2, and 3 (Figure 1 1 .6.2). On the ^th line, the point that lies x 
units to the right of the zero mark would be (x, k). The set 1(2", (n) 1) | n e Z), which is plotted on Figure 11.6.2, is a subgroup of 1R+ x Z4. 
What cyclic subgroup is it? 



12 3 4 
3 • 

2 • • 

1 • • 

• • 

Figure 1 1 .6.2 
Graph of IR* x Z4 



The answer: ((2, l))or(0', 3)). 

A more conventional direct product is R^, the direct product of two factors of [R; + ]. The operation on R^ is componentwise addition; hence 
we will use + as the operation symbol for this group. You should be familiar with this operation, since it is identical to addition of 2 X 1 
matrices. The Cartesian coordinate system can be used to visualize R- geometrically. We plot the pair (s, t) on the plane in the usual way: s 
units along the x axis and t units along the y axis. There is a variety of different subgroups of R^ , a few of which are: 

(1) ((x, 0) I X e R), all of the points on the x axis; 

(2) {(x, j) I 2 X - J = 0), all of the points that are on the line 2x - y = 0; 

(3) \i a, b & R, {(x, y)\ax + by = 0]. The first two subgroups are special cases of this one, which represents any line that passes 
through the origin. 

(4) {(x, y)\2x-y = k, k & Z}, a set of lines that are parallel to 2 x - y = Q. 

(5) {(n, 3ri) \ n & J.], which is the only countable subgroup that we have listed. 

We win leave it to the reader to verify that these sets are subgroups. We will only point out how the fourth example, call it H, is closed under 
"addition." If a = {p, q) and b = (j, t) and both belong to H, then 2p - q = j and 2s — t= k, where both j and k are integers. 

a + b = (p, q) + (s, t) = (p + s, q + t) 

We can determine whether a + b belongs to H by deciding whether or not 2(_p + s) - (_q + t)is m integer: 

2{p + s) - {q + t) = 2 p + 2 s - q - t 

= (2p - q) + (2s - t) 
= j + k 

which is an integer. This completes a proof that H is closed under the operation of R^. 

Several useful facts can be stated in regards to the direct product of two or more groups. We wMl combine them into one theorem, which we 
win present with no proof. Parts a and c were derived f orn = 2 in the proof of Theorem 11.6.1. 

Theorem 11.6^. If G = Gi x G2 x ••• x G„ is a direct product of n groups and (ai, 02 , .... a„) e G, then: 

(a) Theidentity ofGis(ei, 62, .... e„), where e^^, is the identity of G;^. 

(b) (ai, 02 , .... a„)"' = (ar\ 02"' , . .. , a„~^) . 

(c) (au 02, a„)"'= (ar, 02"", fln"*) forallmeZ. 

(d) G is abelian if and only if each of the factors Gi , G2, . . . , G„ is abelian. 

(e) If //] , H2, . . . , H„ are subgroups of the corresponding factors , then H] x H2 x ■■■ x H„ a subgroup of G. 

Not all subgroups of a direct product are obtained as in part e of Theorem 1 1 .6.2. For example, {(n, n) | n e Z) is a subgroup of Z^, but is not 
a direct product of two subgroups of Z. 

Example 11.63. Using the identity (x + y) + x = y, in Z2, we can devise a scheme for representing a symmetrically linked list using only 
one link field. A symmetrically linked list is a list in which each node contains a pointer to its immediate successor and its immediate predeces- 
sor (see Figure 1 1 .6.3). If the pointers are n-digit binary addresses, then each pointer can be taken as an element of Z2" . Lists of this type can be 
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accomplished using cells with only one link. In place of a left and a right pointer, the only "Unk" is the value of the sum (left link) + (right 
link). All standard list operations (merge, insert, delete, traverse, and so on) are possible with this structure, provided that you know the value of 
the nil pointer and the address, /, of the first (i. e., leftmost) cell. Since first / .left is nil, we can recover /'.right by adding the value of nil: 
/ + nil = (nil + /.right) + nil = / .right, which is the address of the second item. Now if we temporarily retain the address, s, of the second 
cell, we can recover the address of the third item. The link field of the second item contains the sum j.left + j.right = first + third. Therefore 

(first + third) + first = s + s.left 

= ( J. left + J. right) + j.left 
= j.right = third 

We no longer need the address of the first cell, only the second and third, to recover the fourth address, and so forth. 



A = Nil= 0000 
1001 1101 







B 



















D 















A 




B 




C 




D 




E 


1101 




1010 




1011 




1011 




0110 



Figure 11.6.3 
Symmetric Linked List 

The following more formal algorithm uses names that the timing of the visits. 

Algorithm 11.6.1. Given a symmetric list represented as in Example 11.6.3, a traversal of the list is accomplished as follows, where first 
is the address of the first cell. We presume that each item has some information that is represented by item. info and afield called item. link 
that is the sum of the left and right links. 

(1) yesterday =nil 

(2) today = first 

(3) While today / nil do 

(3.1) Write( today .info ) 

(3.2) tomorrow - today. link + yesterday 

(3.3) yesterday = today 

(3.4) today = tomorrow. 

At any point in this algorithm it would be quite easy to insert a cell between today and tomorrow. Can you describe how this would be 
accompUshed? 



EXERCISES FOR SECTION 11.6 
A Exercises 

1 . Write out the group table of Z2 x Z3 and find the two proper subgroups of this group. 

2. List more examples of proper subgroups of that are different from the ones in Example 1 1 .6.2. 

3. Algebraic properties of the n-cube: 

(a) The four elements of can be visualized geometrically as the four corners of the 2-cube (see Figure 9.4.5). Algebraically describe 
the statements: 

(i) Corers a and b are adjacent. 

(ii) Comers a and b are diagonally opposite one another. 

(b) The eight elements of Z2' can be visualized as the eight corners of the 3-cube. One face contains Z2 x Z2 x {0) and the opposite face 
contains the remaining four elements so that (a, b, 1) is behind (a, b, 0). As in part a, describe statements i and ii algebraically. 

(c) If you could imagine a geometric figure similar to the square or cube in n dimensions, and its comers were labeled by elements of Z2" 
as in parts a and b, how would statements i and ii be expressed algebraically? 
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4. (a) Suppose that you were to be given a group [G; *] and asked to solve the equation x * x = e. Without knowing the group, can you 
anticipate how many solutions there will be? 

(b) Answer the same question as part a for the equation x * x = x. 

5. Which of the following sets are subgroups of Z x Z? Give a reason for any negative answers. 

(a) {0} 

(b) {(2y, 2k)\j,k& Z) 

(c) {(2y+ 1, 2k) I j,keZ] 

(d) {(n, n 2 ) I n e Z} 

(e) {(J, k) \ j + k is even} 

6. Determine the following values in group Z3 x R*: 

(a) (2, 1)*(1,2) 

(b) the identity element 

(c) (1, 1/2)-' 
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I 1.7 Isomorphisms 



The following informal definition of isomorphic systems should be memorized. No matter how technical a discussion about isomorphic systems 
becomes, keep in mind that this is the essence of the concept. 

Definition: Isomorphic Systems/Isomorphism. Two algebraic systems are isomorphic if there exists a translation rule between them so 
that any true statement in one system can be translated to a true statement in the other 

Example 11.7.1. Imagine that you are an eight-year-old child who has been reared in an English-speaking family, has moved to Greece, 
and has been placed in a Greek school. Suppose that your new teacher asks the class to do the following addition problem that has been 
written out in Greek. 



Tpia crvv TEcrcrEpa LcrovraL 

The natural thing for you to do is to take out your Greek-English/English-Greek dictionary and translate the Greek words to English, as 
outlined in Figure 11.7.1. After you've solved the problem, you can consult the same dictionary to obtain the proper Greek word that the 
teacher wants. Although this is not the recommended method of learning a foreign language, it will surely yield the correct answer to the 
problem. Mathematically, we may say that the system of Greek integers with addition (crvv) is isomorphic to English integers with addition 
(plus). The problem of translation between natural languages is more difficult than this though, because two complete natural languages are 
not isomorphic, or at least the isomorphism between them is not contained in a simple dictionary. 



Tpia 



three 



plus 



TEcrcrspa 



four 



CTTO 

A 



equals 



Figure 11.7.1 
Solution of a Greek arithmetic problem 

Example 11.7.2. Software Implementation of Sets. In this example, we will describe how set variables can be implemented on a 
computer. We will describe the two systems first and then describe the isomorphism between them. 

System 1: The power set of {1, 2, 3, 4, 5} with the operation union, IJ- For simplicity, we will only discuss union. However, the other 
operations are implemented in a similar way. 

System 2: Strings of five bits of computer memory with an OR gate. Individual bit values are either zero or one, so the elements of this 
system can be visualized as sequences of five O's and I's. An OR gate. Figure 1 1 .7.2, is a small piece of computer hardware that accepts two 
bit values at any one time and outputs either a zero or one, depending on the inputs. The output of an OR gate is one, except when the two bit 
values that it accepts are both zero, in which case the output is zero. The operation on this system actually consists of sequentially inputting 
the values of two bit strings into the OR gate. The result will be a new string of five O's and I's. An alternate method of operating in this 
system is to use five OR gates and to input corresponding pairs of bits from the input strings into the gates concurrently. 



System 1 : 

2, 3, 



, 5); UJ 



System 2 : 

Strings of 5 bits with OR 



u 



a: = |i,2) 



11000 



Figure 11.7.2 
Translation between sets and strings of bits 

The Isomorphism: Since each system has only one operation, it is clear that union and the OR gate translate into one another. The translation 
between sets and bit strings is easiest to describe by showing how to construct a set from a bit string. If a i 02 04 05, is a bit string in System 
2, the set that it translates to contains the number k if and only if a^- equals 1 . For example, 10001 is translated to the set 1 1 , 5), while the set 
1 1 , 2) is translated to 1 1 000. Now imagine that your computer is like the child who knows English and must do a Greek problem. To execute 
a program that has code that includes the set expression {1, 2| U |1, 5), it will follow the same procedure as the child to obtain the result, as 
shown in Figure 1 1 .7 .3 . 



11.2) 



11000 



u 



OR 



1 1,5) 



10001 



1 1,2.5 ) 



11001 



Figure 11.7.3 
Translation of a problem in set theory 

Example 11.7.3. IVIultiplying without doing multiplication. This isomorphism is between [R+ ; •] and [IR;-f-]. Until the 1970s, when the 
price of calculators dropped, multiplication and exponentiation were performed with an isomorphism between these systems. The isomor- 
phism (R+ to R) between the two groups is that • is translated into + and any positive real number a is translated to the logarithm of a. To 

translate back from IR to R* , you invert the logarithm function. If base ten logarithms are used, an element of R, b, will be translated to 10*. 
In pre-calculator days, the translation was done with a table of logarithms or with a slide rule. An example of how the isomorphism is used 
appears in Figure 1 1 .7.4. 
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125 



1000 



0.90309 



2.09691 = 3.000000 

Figure 11.7.4 
Multiplication using logarithms 



The following definition of an isomorphism between two groups is a more formal one that appears in most abstract algebra texts. At first 
glance, it appears different, it is really a slight variation on the informal definition. It is the common definition because it is easy to apply; that 
is, given a function, this definition tells you what to do to determine whether that function is an isomorphism. 

Procedure for showing that two groups are isomorphic 

Definition: Group Isomorphism. lf[Gi ; *i] and [G2 : *2] are groups, f : G\ -> G2 is an isomorphism from Gi into Gi if: 

(a) fis a bijection, and 

(b) f(a *i b) = f(d) *2f(b) for all a, b e Gi 
If such a function exists, then G\ is isomorphic to G2- 

Notes: 

(a) There could be several different isomorphisms between the same pair of groups. Thus, if you are asked to demonstrate that two groups 
are isomorphic, your answer need not be unique. 

(b) Any application of this definition requires a procedure outlined in Figure 11 .7 .5 . 



Define a function T.-Gi^Gi and 
prove that T is an isomorpiiism. 



Prove that 7'"' 






Prove that 


is a bijection. 


T(a *i b)^ T{a) ^2 T{b) for all a,b e Gj 




Prove ttiat T 
is onto. 



Prove ttiat T 
is one-to-one. 



Figure 11.7.5 
Steps in proving that dand G2 are isomorphic 

The first condition, that an isomorphism be a bijection, reflects the fact that every true statement in the first group should have exactly one 
corresponding true statement in the second group. This is exactly why we run into difficulty in translating between two natural languages. To 
see how Condition (b) of the formal definition is consistent with the informal definition, consider the Function L : ^ R defined by 
L{x) = logigX. The translation diagram between R+ and R for the multiplication problem a ■ b appears in Figure 11.7.6. We arrive at the 

same result by computing L"' (L(a) + L{b)) as we do by computing a • ^. If we apply the function L to the two results, we get the same 
image: 

L(a ■ b) = L(L-HL(a) + L(b))) = L(a) + L(b) (11.7a) 
since L(L"' (x)) = x. Note that 1 1 .7a is exactly Condition b of the formal definition applied to the two groups R+ and R . 
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Ha) 



m 



L-\Ua * b)) 

A 

Ua-b) 

Figure 11.7.6 
Multiplication using logarithms - general situation 



Example 11.7.4. Consider G=|(q jj|aelR| with matrix multiplication. This group [R; +] is isomorphic to G. Our translation rule is 



the function / : R -> G defined by f(a) ■ 



1 a 
1 



Since groups have only one operation, there is no need to state explicitly that addition is 
translated to matrix multiplication. That/is a bijection is clear from its definition. If a and b are any real numbers, 

_ / 1 a + b\ 

" lo 1 j 

= f{a + b) 

We can apply this translation rule to determine the inverse of a matrix in G. We know that a + (-a) = is a true statement in R. Using /to 
translate this statement, we get 

mn-a) = m 

or 



I 1 a \l 1 -a\_ll 

lo 1 jlo 1 j"lo 1 



therefore. 



( 



1 a 
1 



1 -a 
1 



Theorem 11.7.1 summarizes some of the general facts about group isomorphisms that are used most often in applications. We leave the proof 
to the reader. 

Theorem 11.7.1. lf[G;*\ and [H, o] are groups with identities e and e\ respectively, and T : G H is an isomorphism from G into 
H, then: 

(a) T (e) = e ', 

(b) T(a)-' = T[a-') for all a eG,and 

(c) If K is a subgroup of G, then T (K) = {T(a) : a e K] is a subgroup of H and is isomorphic to K. 

"Is isomorphic to" is an equivalence relation on the set of all groups. Therefore, the set of all groups is partitioned into equivalence classes, each 
equivalence class containing groups that are isomorphic to one another. 



Procedures for showing groups are not isomorphic 

How do you decide that two groups are not isomorphic to one another? The negation of "G and H are isomorphic" is that no translation rule 

between C and H exists. If G and H have different cardinalities, then no bijection from G into H can exist. Hence they are not isomorphic. 
Given that |G| = \H\, it is usually impractical to list all bijections from G into H and show that none of them satisfy Condition b of the formal 
definition. The best way to prove that two groups are not isomorphic is to find a true statement about one group that is not true about the other 
group. We illustrate this method in the following checklist that you can apply to most pairs of non-isomorphic groups in this book. 

Assume that [G; *] and [H; o] are groups. The following are reasons for G and H to be not isomorphic. 

(a) G and H do not have the same cardinality. For example, Z12 x Z5 can't be isomorphic to Z50 and [R; +] can't be isomorphic to [Q* ; •], 

(b) G is abelian and H is not abelian since a * b = b ■* ai& always true in G, but T(a) T (b) = T (b) o T (a) would not always be true. Two 
groups with six elements each are Zg and the set of 3 x 3 rook matrices (see Exercise 5 in Section 11.2). The second group is non-abeUan, 
therefore it can't be isomorphic to Zg . 

(c) G has a certain kind of subgroup that H doesn't have. Theorem 11 .7.1(c) states that this cannot happen if G is isomorphic to H. [R* ; ■] and 
[R+ ; •] are not isomorphic since R* has a subgroup with two elements, {-1, 1), while the proper subgroups of R+ are all infinite (Convince 
yourself of this fact!). 

(d) The number of solutions of x * x = e in G is not equal to the number of solutions of y ^ y = e' in H. Zg is not isomorphic to Z2'' since 
X +i X = has two solutions, and 4, while y + y = (0, 0, 0) is true for all y e Z2'. If the operation in G is defined by a table, then the 
number of solutions of x * x = e will be the number of occurrences of e in the main diagonal of the table. The equations x^ = e, x* = e, ... 
can also be used in the same way to identify non-isomorphic groups. 
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(e) One of the cyclic subgroups of C equals G (i. e., G is cyclic), while none of //'s cyclic subgroups equals H (i. e., H is noncyclic). This is a 
special case of Condition c. Z and Z x Z are not isomorphic since Z = (1) and Z x Z is not cyclic. 

EXERCISES FOR SECTION 11.7 

A Exercises 

1 . State whether each pair of groups below is isomorphic. If it is, give an isomorphism; if it is not, give your reason. 

(a) Z X R andR x Z 

(b) ZjXZ andZ X Z 

(c) IR and Q X Q 

(d) 1 , 2)) with symmetric difference and Z2^ 

(e) Z2^andZ4 

(f) and M2x2(IR) with matrix addition 

(g) r2 X R+ 

(h) Z2 and the 2x2 rook matrices 

(i) Zg and Z2X Z3 

2. If you know two natural languages, show that they are not isomorphic. 

3. Prove that the relation "is isomorphic to" on groups is transitive. 

4. (a) Write out the operation table for G = [{1, — 1, i, -i), ■] where i is the complex number for which f = - I. Show that G is isomor- 
phic to [Z4; +4]. 

(b) Solve = -1 in G by first translating to Z4 , solving the equation in Z4 , and then translating back to G. 

B Exercises 

5. It can be shown that there are five non-isomorphic groups of order eight. You should be able to describe at least three of them. Do so 
without use of tables. Be sure to explain why they are not isomorphic. 

6 . Prove Theorem 1 1 .7 . 1 . 

7. Prove that all infinite cyclic groups are isomorphic to Z. 

8. (a) Prove that R' is isomorphic to Z2 x R. 

(b) Describe how multiplication of nonzero real numbers can be accomplished doing only additions and translations. 

9. Prove that if G is any group and g is some fixed element of G, then the function defined by 4>g(x) = g*x*g~^ is an isomorphism from G 
into itself. An isomorphism of this type is called an automorphism. 
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1 11.8 Using Computers to Study Groups 

Groups in Mathematica 

Mathematica has a wide variety of computable databases available and one of them is on finite groups. To access the database you use the 
function FiniteGroupData. Extensive documentation is available at . Since we've only scratch the surface of group theory at this point, 
most of the groups and concepts mentioned are likely to be unfamiliar to the reader. For this reason, we well wait until Chapter 15 to discuss 
that database. 

The Combinatorica package that is included in all Mathematica distributions has limited abstract algebra 
<< Combinatorica" 

Here is how to generate the body of the operation table for the ring [Z7; +7]. Notice that this really an addition table even though the function 
that creates the table is called MultiplicationTable. 

MultiplicationTable [Range [0, 6], Function[{a, b}, Mod[a + b, 7]]] 

/ 1 2 3 4 5 6 7 \ 

2 3 4 5 6 7 1 

3 4 5 6 7 1 2 

4 5 6 7 1 2 3 

5 6 7 1 2 3 4 

6 7 1 2 3 4 5 
,7123456, 

An even more user-friendly package that you would need to download to use is available at Exploring Abstract Algebra with Mathematica 
(http://www.central.edu/EAAM/). The package, when installed on your computer, is loaded with the command 

<< AbstractAlgebra^Master" 

The group Z12 is 

6 = Z6[12] 

Groupoid({0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11}, (ttl + tt2)mod 12 &) 
At this point 6 is an object that consists of the set {0, 1 , 2, . . . , 1 1} and the binary operation +12. Among things we can do with 6 is that we can 

examine its subgroups. 

Subgroups [G] 

{Groupoid((0), (ttl + tt2) mod 12 &), Groupoid((0, 2, 4, 6, 8, 10), (ttl + tt2) mod 12 &), 
Groupoid({0, 3,6,9}, (ttl + tt2) mod 12 &), Groupoid({0, 4, 8}, (ttl + tt2) mod 12 &), 

Groupoid((0, 6), (ttl +tt2)modl2&), Groupoid((0, 1, 2, 3,4, 5, 6, 7, 8, 9, 10, 11), (ttl +tt2)modl2&)) 

We can view the inverses of elements in a variety of ways. For example, we can get them paired up. Notice that two of the elements, and 6 
invert themselves. 

Inverses [6] 

IQ \ 

1 11 

2 10 

3 9 

4 8 

5 7 

.6 6 , 

There is a "Visual Mode" that gives us a different view of the inverses. The boxes with "?" and "->" give further information in you are reading 
this in a Mathematica Notebook and have the package installed. 
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Inverses [G, Mode -* Visual] 






6 



The package was designed for teaching a first course in abstract algebra and so it has features that are more basic than other abstract algebra 
resources. For example, we can ask G is really a group and get quite a bit of information. 
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GroupQ[G, Mode -> Textual] 

Given a set S and an operation *, we call the pair (S,*) 

a group if S is closed under the operation *, there is an identity 

element, every elemement has an inverse and the operation * is 

associative. 



We say a Groupoid G has an identity e if for all other elements g in G we have e + g = g + e = g (where + indicates the operation). 
In this case, Z[12J has the identity 0. 



We say that a set S is closed under an operation op if whenever we have x and y in S, we also have op[x,y] (or x~op~y) in S. 
In this case, the Groupoid Z[12] is indeed closed. 



Given a Groupoid G, we say an element g in G has an inverse h if G has an identity, say e, and g + h = h + g 
= e (where + indicates the operation). The Groupoid Z[12] has an inverse for every element. Here they are: 



X 


x-1 








1 


11 


2 


10 


3 


9 


4 


8 


5 


7 


6 


6 



r ~ 

Given a structured set S (Groupoid or Ringoid), we say the operation * is 

associative if for every g, h, and k in S we have (g*h)*k = g*(h*k), where * is the group operation. 
In this case, Z[12] is associative. Consider the following table illustrating random 

triples that associate. Pay attention to the last two columns. 



1 


j 


k 


(i*j)*k 


i*(j*k) 


2 


11 


4 


5 


5 


3 


1 


4 


8 


8 


8 


9 


8 


1 


1 


7 


5 


2 


2 


2 


8 


8 


2 


6 


6 


10 


6 


9 


1 


1 


4 


9 


7 


8 


8 


4 


5 


2 


11 


11 


11 


4 


1 


4 


4 


5 


10 


6 


9 


9 



This package also has much more capabilities than what we've covered so far and we will revisit it in Chapters 15 and 16. 

Groups in Sage 

Abstract Algebra seems to have been given a much higher priory in the design of Sage than it was in Mathematica. Again, the capabilities far 
exceed what we've touch on in the theory, but here are a few examples that you should understand. Here is how to generate the group related to 
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Z,4. 



G=AbelianGroup( 1 , [ 14 ] ) 
G. list( ) 

[1, f, f"2, £"3, f"4, £"5, f"6, fl, f"8, £"9, flO, fll, f"12, f^lS] 

There is no output from assigning G. The elements of G are generated from the list method. The connection with Z14 is that when we 
muhiply powers of f , the exponents are added with +14. Among other things we can ask whether G is abelian and what its subgroups are. 



G. is_abelian ( ) 
True 

G. subgroups ( ) 

[Multiplicative Abelian Group isomorphic to C2 x C7, which is the 
subgroup of 

Multiplicative Abelian Group isomorphic to C14 

generated by [f]. Multiplicative Abelian Group isomorphic to C7, which 
is the subgroup of 

Multiplicative Abelian Group isomorphic to C14 

generated by [f^2]. Multiplicative Abelian Group isomorphic to C2, which 
is the subgroup of 

Multiplicative Abelian Group isomorphic to C14 

generated by [f^7]. Trivial Abelian Group, which is the subgroup of 
Multiplicative Abelian Group isomorphic to C14 
generated by [ ] ] 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



288 



SUPPLEMENTARY EXERCISES FOR CHAPTER 11 
Section 11.1 

1. V= {a,b, c} is a set with operations + and ■ defined by the following "addition" and "multiplication" tables: 



a b c 



a b c a 
b c a b 
cab c 



a b c 



a a a 
a b c 
a c b 



a, h, c e [r| 



(a) With respect to V under + determine, 

(i) The identity (i.e., the "zero" of the addition). 

(ii) The inverse of each element, that is, -a, -b, and -c. 

(b) With respect to V under • determine, 

(i) The identity (i.e., the "one" of the multiplication). 

(ii) The inverse of each element different from "zero." 

(c) Is + distributive over ■ ? Is ■ distributive over + ? 

2. (a) Determine whether the following are valid binary operations on the given sets. Explain fuUy. 

i( a h 

(i) Matrix addition on A = < 

\\c 

(ii) Matrix multiplication on the set A above. 

(iii) On Q+, define * hy a * h = (a ■ h) /2. 

(iv) Function composition on A'' = {/: A -> A}, where A is {1, 2, 3}. 

(v) Function composition on B = {/ e A'* | / is a bijection} . 

(b) For each binary operation above give the identity element if it exists. Explain. 

(c) Determine which of the above binary operations are commutative and which are associative. 

3. Let S = set of all bijections of a set A, and let ° be function composition. Does ° have the inverse property? Does function composition have 
the involution property? Explain. 

4. Does + on M2x2(.^) have the inverse property? Does + have the involution property? Explain. 

5. Prove that the odd integers are closed under multiplication but not under addition. Are the even integers closed under both addition and 
multiplication? Prove your answers. 

Section 11.2 

6. (a) Show that is a group under componentwise addition, that is, 

(ai, 02) + (bi, bi) = (ai + 02, bi + 62)- 

(b) Show that {(x, 2 x) | x e R} is a group under componentwise addition. Draw the graph of this subset. Describe similar 

subsets of that are also groups. 

7. Prove that the set of all 2 x 2 invertible matrices (over R) is a group under matrix multiplication. Assume, as indicated in Chapter 5, that the 
associative law is true for matrices under multiplication. This group is called the general linear group of degree 2 over R, and it is denoted by 
GL(2, R). It is given this name because these matrices are matrix representations of linear motions of R^ . 

8. Prove that = ^ ^ ^ j det A = l| is a group imder matrix multiplication. Assimie that the associative law is true under matrix multi- 
plication. This group is called the special linear group of degree 2 over R and it is denoted SL(2, R). 

9. Show that R is a group under the operation * defined by a * b = a + b + 5 iot a, b & 

10. (a) let 63x3 be the set of all 3 x 3 Boolean (adjacency) matrices discussed in Section 6.4. Is 53^3 a monoid under Boolean addition? Is it a 
group? Explain. 

(b) Is B3x3 a monoid under Boolean multipMcation? Is it a group? Explain. 
Section 113 
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11. Define * on Q"^ by a * = (a • b)/2. Prove that [Q"^ ; *] is a group. 

12. Let G be the group R under the operation a * b = a + b + 5fora, b e R. Solve the following equations for in G. 

(a) ;c * 3 = 5 (d) = 2 

(b) 2 * X * 4 = 6 (e) 4 * x^ = 5 

(e) jc^ = 7 

13. Solve the equation A*X ■* B = Cin GL(2, R) where 

-(os). -(^i). - -(»;)■ 

14. Prove that if [G; *] is a group, (a * b)" = a" * b" for aH n > 1 and a, e G if and only if [G;*] is an abeUan group. 
Section 11.4 

15. Calculate the following in Z5: 

(a) 3+58 

(b) (-3) X5 2 

(c) (3 X5 2) +5 (2 X5 2) 

(d) 2 ' (i.e., the multipUcative inverse of 2) 

16. (a) Prove that {1, 3, 5, 7}, is a group under Xg . Write out its group table. 

(b) Let U (Z„) stand for the elements of Z„ , which have inverses under x„ . Convince yourself that U(Z„ ) is a group under x„ . 

(c) Prove that the elements of U(Z„ ) are those elements a e Z„ such that gcd(a, n) = 1. You may use the fact that gcd(a, = 1 <=> 
there exist integers i and t such that sa + tb = I. 

Section US 

17. (a) Recall from "Supplementary Exercises," Section 1 1 .4, that ?7(Zg )is a group under Xg . List all cycUc subgroups of this group, 
(b) Is U(Zg ) a cycUc group? Explain. 

18. (a) Use Theorem 1 1 .5 . 1 to prove that the set of even integers is a subgroup of the group Z (under +) . 
(b) Is the set of odd integers a subgroup of the group Z (under +)? 

19. Prove that SL(2, R) is a subgroup of GL(2, R). See Exercises 7 and 8 above for an explanation of this notation. 

20. Recall that M2x2{R) is a group under addition. 
'a b 



(a) 



Is A = 1^ ° ^ j a, & e r} a subgroup of M2x2 (R)? 



(b) IsB 



a b 
h 1 



a,h e R } a subgroup of M2X2 (R)? 



(c) Are either of the subsets in parts a and b subgroups of GL(2, R)? 

21 . Let B3x3 be the monoid of all 3 x 3 Boolean matrices, imder Boolean addition. Let 5 be a subset of £3x3 consisting of all 3 X 3 matrices 
that represent symmetric relations. Is S a submonoid of 63x3 ? 

Section 11.6 

22. Using the data structure in the text for doubly linked lists with six-bit addresses, what are the addresses of the records containing A and D? 
Write your answer as a sum in the group Z^ and then as an address. 



7 


011100 


000011 


7 


A 




B 




c 




D 






010101 




001011 







23. Determine the inverse of each element in the respective group. 

(a) (2, 3, 5) in Z3 X Z7 X Z25 

(b) (1,0, 1,1) in Z'' 
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(c) (3, 2) in IR+ X Zg 

(d) (2,3,5)inlR3 

24. Determine the identity elements in the following groups: 

(a) IR+XR+ 

(b) IR+XZ3 

(c) GL(2, R) X 

25. Which of the following groups are abeUan? Explain. 

(a) Z2XZ24XZ75 

(b) GL(2, R) xZj 

(c) Z« 

26. Is {0, 3} X {0, 4, 8} a subgroup of Zg x Zg ? Explain. 
Section 11.7 

27. Prove that the cyclic subgroup (4) of Zig is isomorphic to Z4 . 

28. Let G = { &, $, %). Given that [G; *] is a group and that it is isomorphic to the group [Z3; +3] with isomorphism T : G -» Z3 defined 
byr(&) = l,r($) = 2,andr(%) = 0. What are 

(a) $*$ (b) Theidentityof [G; *] 

29. Let U be a set and Pu = {propositions over the set U}. It can be shown that the algebraic system [Py; ~, a , v] is isomorphic to 

[nuy, ; n, U]- 

(a) Explain what this means. 

(b) How does this help you imderstand the language of the algebra of propositions? 

(c) Give the "propositional" analogue to the following statement: If A H = and A f] B = then A = 0. 

30. Write out the operation tables for the following systems: 

(a) [{0, 1}; +, ■] where + and ■ denote Boolean addition and multiplication. 

(b) [{-1, 1}; A, v] where A y and V y denote the largest and smallest, respectively, of andy. 

(c) [Z2; +2, X2]. 

Are these systems isomorphic? Explain. 

31 . Prove that the group C, under +, is isomorphic to the group R^ , under + . 

32. Determine which of the following groups are isomorphic. Explain. 

(a) R3 , the 3 x 3 rook matrices, and Z^ 

(b) R3and5A = (/ e A" : /is abijection), where A is {1, 2, 3}. 

(c) Z(, and U(Zj) 

33. Prove that R'' imder addition, is isomorphic to M2x2(IR), vmder addition. 

34. Prove that the group [U(I.g); Xg] is isomorphic to [Z4; +4 ] . 



Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution-NonConmiercial-ShareAlike 3.0 United States License. 



291 



chapter 12 




MORE MATRIX ALGEBRA 



GOALS 

In Chapter 5 we studied matrix operations and the algebra of sets and logic. We also made note of the strong resemblance of matrix algebra to 
elementary algebra. The reader should briefly review this material. In this chapter we shall look at a powerful matrix tool in the applied 
sciences— namely, a technique for solving systems of linear equations. We will then use this process for determining the inverse oi nxn 
matrices, n > 2, when they exist. We conclude by a development of the diagonalization process, with a discussion of several of its applications. 

1 12.1 Systems of Linear Equations 

The method of solving systems of equations by matrices that we will look at is based on procedures involving equations that we are familiar 
with from previous mathematics courses. The main idea is to reduce a given system of equations to another simpler system that has the same 
solutions. 

Definition: Solution Set. Given a system of equations involving real variables Xj, X2 x„, the solution set of the system is the set of n- 

tuples in R", (aj, a2 a„) such that the substitutions x\ = a\ , X2 = a2, ■ ■ ., x^ = an make all the equations true. 

In terms of logic, a solution set is a truth set of a system of equations, which is a proposition over n-tuples of real numbers. 

In general, if the variables are from a set 5, then the solution set will be a subset of S" . For example, in number theory mathematicians study 
Diophantine equations, where the variables can only take on integer values instead of real values. 

Definition: Equivalent Systems of Equations. Two systems of linear equations are called equivalent if they have the same set of 
solutions. 

Example 12.1.1. The previous definition tells us that if we know that the system 

4xi + 2x2 + X3 = 1 
2xi + X2 + X3 = 4 
2xi + 2x2 + X3 = 3 

is equivalent to the system 

X'l + 0x2 + 0x3 = -1 
Oxi + X2 + 0X3 = -1 
Oxi + 0x2 + X3 = 7 

then both systems have the solution set {(-1, -1, 7)). In other words, the values xi = -1, X2 = -1, and X3 = 7 are the only values of the 
variables that make all three equations in either system true. 

Theorem 12.1.1. Elementary Operations on Equations. If any sequence of the following operations is performed on a system of 
equations, the resulting system is equivalent to the original system: 

(1) Interchange any two equations in the system. 

(2) Multiply both sides of any equation by a nonzero constant. 
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(3) Multiply both sides of any equation by a nonzero constant and add the result to a second equation in the system, with the sum replacing 
the latter equation. 

Let us now use the above theorem to work out the details of Example 12.1.1 and see how we can arrive at the simpler system.. 

Step 1 . We will first change the coefficient of xi in the first equation to one and then use it as a pivot to obtain O's for the coefficients of xi in 
Equations 2 and 3. 

Ax\ +2x2+ x-i = \ 

(1.1) 2 xi + X2+ x-i=A Multiply Equation 1 by ^ to obtain 

IXi +2X2+ X3=3 
x-i 1 

Xi + — + - = - 

'244 

(1 .2) 2 xi + X2 + X3 = 4 Multiply Equation 1 by - 2 and 
2 xi + 2 X2 + X3 = 3 

add the result to Equation 3 to obtain 

X2 X^ 1 

(1 .3) xi + Xi + — = - Multiply Equation 1 by - 2 and add 

1 ^22 

2 xi + 2 X2 + X3 =3 
the result to Equation 3 to obtain 

X2 X3 1 

X| + — + — = - 
' 2 4 4 

(1.4) Oxi + 0x2+ f = ^ 

n X^ 5 

Xi + X2+ ^ = - 

Note: We've explicitly written terms with zero coefficients such as xi to make a point that aU variables can be thought of as being 
involved in aU equations. After this example we wiU discontinue this practice in favor of the normal practice of making these terms 
"disappear." 

Step 2. We would now like to proceed in a fashion analogous to Step 1 — namely, multiply the coefficient of X2 in the second equation by a 
suitable number so that the result is 1. Then use it as a pivot to obtain O's as coefficients for X2 in the first and third equations. This is clearly 
impossible (Why?), so we wiU first interchange Equations 2 and 3 and proceed as outlined above. 

xi + ^ + ^ = i 

' 2 4 4 



(2.1) xi + X2 + Y = J Interchange Equations 2 and 3 to obtain 

r\ X-j $ 

0X1+ ■«2+ 7 =- 

Xi + ^ + ^ = i 

' 2 4 4 

(2.2) xi + X2 + Y = I Multiply Equation 2 by - j and add 
xi + X2 + — = - result to Equation 1 to obtain 



(2.3) 



X\ + 0X2+ Oxo, = -\ 
Oxi +X2 +Y=2 
Oxi + 0X2 + 



7 

2 2 



Step 3. Next, we wiU change the coefficient of X3 in the third equation to one and then use it as a pivot to obtain O's for the coefficients of X3 in 
Equations 1 and 2. 

Xi + 0X2 + 0X3 = -1 

(3.1) Oxi +-^2 +7= J Multiply Equation 3 by 2 to obtain 
Oxi +0x2 + f-= ^ 



Xi + X2 + X3 = - 1 

V, 5 

)X| + Xi + — = - 
I - 2 2 

Oxi + 0x2 +X3 = 7 to Equation 2 to obtain 



(3.2) Oxi +X2 Multiply Equation 3 by - i and add the result 
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Xi +0X2 +0X3 = -1 

(3.3) Oxi +X2 +0x3 = -1 

Xi + X2 + X3 = 7 

From the system of equations in Step 3.3, we see that the solution to the original system (Step 1.1) is xi = -1,X2 = -1, andxs = 7 . 

In the above sequence of steps, we note that the variables serve the sole purpose of keeping the coefficients in the appropriate location. This we 
can effect by using matrices. The matrix of the system given in Step 1 .1 is 

Main 

2 114 
,2213 

where the matrix of the first three columns is called the coefficient matrix and the complete matrix is referred to as the augmented matrix. Since 

we are now using matrices to solve the system, we will translate Theorem 12.1.1 into matrix language. 

Definition: Elementary Row Operations. The following operations on a matrix are called elementary row operations: 

(1) Interchange any two rows of the matrix. 

(2) Multiply any row of the matrix by a nonzero constant. 

(3) Multiply any row of the matrix by a nonzero constant and add the result to a second row, with the sum replacing the second row. 

Definition: Row Equivalent. Two matrices, A and B, are said to be row-equivalent if one can be obtained from the other by any one 
elementary row operation or by any sequence of elementary row operations. 

If we use the notation Rj to stand for Row i of a matrix and — > to stand for row equivalence, then 

cRi+Rj 

A >B 

means that the matrix B is obtained from the matrix A by multiplying the Row of A by c and adding the result to Row j. The operation of 
multiplying row by c is indicated by 

cR, 

A — 

while interchanging rows ; and j is denoted by 

Rl^R: 

A >B. 

The matrix notation for the system given in Step 1 .1 with the subsequent steps are: 



I A 


2 


1 


1 ^ 


-fii 


1 


1 


1 


i ^ 






2 


4 


4 


2 


1 


1 


4 




2 


1 


1 


4 


.2 


2 


1 


3 , 




,2 


2 


1 


3 , 







2 4 
1 



1_ \ 

4 
7 
2 



U 2 1 3 J 



-2«, + «3 



2 





1 



1 i i 

4 4 

\_ 7 

2 2 

1 5 



-R2+R1 



2 2) 
1 

1 i 

2 

7 



V 



1 

1 




1 i i 

2 4 4 
]_ 5 

2 2 

1 1 

2 2 



-1 ^ 

5 
2 
7 

2 I 
-1 ^ 



2R3 



1 -1 \ 



1 - 

2 

lO 1 



10-1 
,0017 

This again gives us the solution. This procedure is called the Gauss-Jordan elimination method. 

It is important to remember when solving any system of equations via this or any similar approach that at any step in the procedure we can 
rewrite the matrix in "equation format" to help us to interpret the meaning of the augmented matrix. 

In Example 12.1.1 we obtained a unique solution, only one triple, namely (-1, -1, 7), which satisfies all three equations. For a system 
involving three unknowns, are there any other possible results? To answer this question, let's review some basic facts from analytic geometry. 

The graph of a linear equation in three-dimensional space is a plane. So geometrically we can visualize the three linear equations as three planes 

in three-space. Certainly the three planes can intersect in a unique point, as in Example 12.1.1, or two of the planes could be parallel. If two 
planes are parallel, there are no common points of intersection; that is, there are no triple of real numbers that will satisfy all three equations. 
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Also, the three planes could intersect along a common axis or line. In this case, there would be an infinite number of real number triples in 
that would satisfy all three equations. Finally if all three equations describe the same plane, the solution set would be that plane. We 
generalize; 

In a system of n linear equations, n unknowns, there can be: 

(1) a unique solution, 

(2) no solution, or 

(3) an infinite number of solutions. 

To illustrate these points, consider the following examples: 
Example 12.1^. Find all solutions to the system 

Xi +3X2 + X3 = 2 

xi + X2 + 5 X3 = 4 
2xi + 2x2 + 10x3 = 6 
The reader can verify that the augmented matrix of this system. 



reduces to 



f 1 


3 


1 


2 \ 


1 


1 


5 


4 


a 


2 


10 


6 , 


f 1 


3 


1 


2 \ 


1 


1 


5 


4 


.0 








-2 , 



(See exercise 4 of this section.) 



We can row-reduce this matrix further if we wish. However, any further row-reduction will not substantially change the last row, which, in 
equation form, is Oxi -1- 0x2 -1- 0x3 = -2, or simply = -2. It is clear that we caimot find real numbers xi, X2, and X3 that will satisfy this 
equation, hence we cannot find real numbers that wiU satisfy all three original equations simultaneously. When this occurs, we say that the 
system has no solution, or the solution set is empty. 

Example 12.13. Next let's attempt to find all of the solutions to: 

Xi -F 6 X2 -F 2 X3 = 1 

2xi + X2 + 3 X3 = 2 
4xi -F 2x2 ^- 6x3 = 4 

The augmented matrix for the system. 



reduces to 



f 1 


6 


2 


1 ' 


2 


1 


3 


2 


.4 


2 


6 


4 , 



1 





16 


1 ' 






11 







1 


1 









11 




.0 








, 



If we apply additional elementary row operations to this matrix, it will only become more complicated. In particular, we cannot get a one in 
the third row, third column. Since the matrix is in simplest form, we will express it in equation format to help us determine the solution set. 



Xi 



16 

-I- — X3 = 1 

11 ^ 

X2+ ^X3 =0 



Any real numbers wiU satisfy the last equation. However, the first equation can be rewritten as xi 



- xa, which describes the coordi- 



nate xi in terms of X3 . Similarly, the second equation gives xiin terms of X3 .A convenient way of listing the solutions of this system is to use 
set notation. If we call the solution set of the system S, then 



= {(1 - ^Xi, -J^Xi, X3) I X3 I 



What this means is that if we wanted to list all solutions, we would replace X3 by all possible numbers. Clearly, there is an infinite number of 
solutions, two of which are (1 , 0, 0)and(-15, -1, 11). 

A Word Of Caution; Frequently we may obtain "different-looking" answers to the same problem when a system has an infinite number 
of answers. Assume a student's solutions set to Example 12.1.3 is A = {(1 + 16x2, X2, -11 X3) | X3 e R). Certainly the result described by 
S looks different fi-om that described by A. To see whether they indeed describe the same set, we wish to determine whether every solution 
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produced in S can be generated in A. For example, the solution generated by S when = 11 is (-15, -1, 11). The same triple can be 
produced by A by taking X2= -1. We must prove that every solution described in S is described in A and, conversely, that every solution 

described in A is described in S. (See Exercise 6 of this section.) 

To summarize the procedure in the Gauss-Jordan technique for solving systems of equations, we attempt to obtain I's along the main diagonal 
of the coefficient matrix with O's above and below the diagonal, as in Example 12.1.1. We may find in attempting this that the closest we can 
come is to put the coefficient matrix in "simplest" form, as in Example 12.1.3, or we may find that the situation of Example 12.1.1 evolves as 
part of the process. In this latter case, we can terminate the process and state that the system has no solutions. The final matrix forms of 
Examples 12.1.1 and 12.1.3 are called echelon forms. 

In practice, larger systems of linear equations are solved using computers. Generally, the Gauss-Jordan algorithm is the most useful; however, 
slight variations of this algorithm are also used. The different approaches share many of the same advantages and disadvantages. The two major 
concerns of all methods are: 

(1) minimizing inaccuracies due to rounding off errors, and 

(2) minimizing computer time. 

The accuracy of the Gauss- Jordan method can be improved by always choosing the element with the largest absolute value as the pivot element, 
as in the following algorithm. 

Algorithm 12.1.1. Given a matrix equation Ax = b, where A is n x m, let C be the augmented matrix [A | fo]. The process of row- 
reducing to echelon form involves performing the following algorithm where C,- = the row of C: 

i = 1 

j = 1 

while (i < n and j < m) : 

# Find pivot in column j, starting in row i: 
max! = i 

for k = i+1 to n: 

if abs(C[k,j]) > abs(C[maxi, j ] ) then 
max! : = k 
if C[maxi,j] f then 

interchange rows i and maxi 
divide each entry in row i by C[i,j] 
# Now C[i,j] will have the value 1. 
for u = i+1 to n: 

subtract C[u, j] * Ci from Cu 
# Now C[u,j] will be 
i := i + 1 
end if 
j = j + 1 
end while 



At the end of this algorithm, with the final form of C you can revert back to the equation form of the system and a solution should be clear. In 

general, 

(a) If any row ofC is all zeros, it can be ignored. 

(b) If any row of C has all zero entries except for the entry in the (m + 1)" position, the system has no .volution. Otherwise, if a column has 
no pivot, the variable corresponding to it is a free variable. Variables corresponding to pivots are basic variables and can be expressed in 
terms of the free variables. 



Example 12.1.4. If we apply Algorithm 12.1.1 to the system 

5 Xi + X2 + 2X3 + X4 = 2 

3xi + X2 - 2x3 = 5 

Xl + X2 + 3 X3 - X4 = -1 

the augmented matrix 





(5 


1 


2 


1 


2 \ 


c = 


3 


1 


-2 





5 




, 1 


1 


3 


-1 


-1 . 



is reduced to a new value of C: 

10 
<^= 1 - 



1 i ^ 

2 2 
3 3 
2 2 

10 1 -1 ; 
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therefore X4 is a free variable in the solution and general solution of the system is 

/■ 1 1 



X3 
X4 ) 



- X4 



3 3 
- + - Xa 
2 2 

-1 

X4 



This conclusion is easy to see if you revert back to the equations that the final value matrix C represents. 

:^|^ Mathematica Note 

The Mathematica function RowReduce does the same reduction as described in Algorithm 12.1.1. For example, here is the result for the 
system in Example 12.1 .4. 

2 

1 



RowReduce 



5 1 
3 1 
1 1 



1 


-1 



■1 I 



< 1 











1 





^0 





1 



3 
2 





I \ 

2 
3 
2 

-1 ; 



Options [RowReduce] 

{Method Automatic, Modulus 0, Tolerance -> Automatic, ZeroTest -> Automatic) 

Only one caution: One needs to be aware that if the pivoting process continues into the last column, which Mathematica will do, there will not 
be a solution to the system. For example the system 

2xi - X2= I 

3x2 - xi =5 
x\ + 5 X2 = 7 
has augmented matrix 

12 -1 1 \ 
C= -1 3 5 . 
.1 5 7, 
Here is the computation to row-reduce: 



I 2 



RowReduce 



1 
5 

7 j 



1 

.001, 

The last row of the final form of C is = 1 and so there is no solution to the original system. 

pTi^ Sage Note 

Given an augmented matrix, C, there is a matrix method called eschewing_f orm that can be used to row reduce C. Here is the result for 
the system in Example 12.1.4. In the assignment of a matrix value to C, notice that the first argument is QQ, which indicates that the entries 
should be rational numbers. As long as all the entries are rational, which is the case here since integers are rational, the row-reduced matrix 
will be all rational. 



C = Matrix(QQ, [ [5,1,2 


1,2] 


, [3,1,-2 


,0,5] , [1,1,3,-1,-1] ] ) 


C. echelon f orm( ) 








[ 1 





1/2 


1/2] 


[ 1 





-3/2 


3/2] 


[ 


1 





-1] 



If we didn't specify the set from which entries are taken, it would assumed to be the integers and we would not get a fully row-reduced matrix. 
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The next step would involve multiplying row 3 by -, which isn't an integer. 



C2 = Matrix([[5,l,2,l,2],[3,l,-2,0,5],[1,1,3,-1,-1]]; 
C2 . echelon_f orm( ) 



[ 


1 


1 


3 


-1 


-1] 


[ 





2 


2 


-3 


1] 


[ 








9 





-9] 



This is why we would avoid specifying real entries: 



C3 



Matrix (RR,[ [5,1,2,1,2] ,[3,1,-2,0,5] ,[1-1-3,-1,-1] ] ; 



C3 . echelon_f orm( ) 

[ 1.00000000000000 
[ 0.000000000000000 
[ 0.000000000000000 



0.000000000000000 
1.00000000000000 
0.000000000000000 



0.000000000000000 
0.000000000000000 
1.00000000000000 



0.500000000000000 
-1.50000000000000 
.93432455388958e-17 



0.500000000000000] 
1.50000000000000] 
-1.00000000000000] 



This is the default number of decimal places, which could be controlled and the single small number in row three column four isn't exactly zero 
because of round-off and we could just set it to zero. However, the result isn't as nice and clean as the rational output in this case. 

EXERCISES FOR SECTION 12.1 
A Exercises 

1 . Solve the following systems by describing the solution sets completely: 

2xi +X2 = 3 

(a) , 

2 xi + X2 + 3 X3 = 5 

(b) 4xi + X2 + 2x3 = -1 
8x1 + 2x2 + 4x3 = -2 

Xi + X2 + 2 X3 = 1 

(c) Xi + 2 X2 - X3 = - 1 

xi + 3 X2 + X3 = 5 

, xi - X2 + 3 X3 = 7 
X\ + 3 X2 + X3 = 4 

2. Solve the following systems by describing the solution sets completely: 

2xi + 2x2 + 4x3 = 2 

(a) 2xi + X2 + 4x3 = 

3 xi + 5 X2 + X3 = 

2xi + X2 + 3x3 = 2 

(b) 4xi + X2 + 2x3 = -1 
8x1 + 2 X2 + 4 X3 = 4 

x\ + X2 + 2 X3 + X4 = 3 

(c) xi - X2 + 3 X3 - X4 = -2 

3 xi + 3 X2 + 6 X3 + 3 X4 = 9 
6x1 + 7x2 + 2x3 = 3 

(d) 4xi + 2x2 + X3 = -2 

6x1 + X2 + X3 = 1 

X\ + X2 - X3 + 2 X4 = 1 

(e) xi + 2 X2 + 3 X3 + X4 = 5 

X\ + 3X2 + 2X3-X4 = -1 

3. Given that the final augmented matrices below obtained from Algorithm 12.1.1, identify the solutions sets. Identify the basic and free 
variables, and describe the solution set of the original system. 



(a) 



/ 1 -5 1.2 
1 4 2.6 

,0 1 4.5 ; 



(c) 



/' 1 9 3 \ 
10 4 
,0 \ , 
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(b) 



no 6 5 ^ 

1-21 
,0 , 



(d) 



/ 1 -3 1 
10 2 2 
,0 1 -1 I ) 



4. (a) Write out the details of Example 12.1 .2. 

(b) Write out the details of Example 12.1 .3. 

(c) Write out the details of Example 12.1 .4. 

5. Solve the following systems using only mod 5 arithmetic. Your solutions should be n - tuples fi-om Z5 . 

2 xi + X2 = 3 



(a) 



(compare your solution to the system in 5(a)) 



Xl + 4 X2 = 1 
Xi + X2+2xi= I 

(b) xi + 2 JC2 + 4jc3 = 4 
xi + 3 X2 + 3 X3 = 

6. (a) Use the solution set 5 of Example 12.1.3 to list three different solutions to the given system. Then show that each of these solutions can be 
described by the set A of Example 12.1 .3. 

(b) Prove that S = A. 

B Exercise 

7. Given a system of n linear equations in n unknowns in matrix form Ax = b, prove that if 6 is a matrix of all zeros, then the solution set of 
Ax = bis a subgroup of R" . 



I 12.2 Matrix Inversion 

In Chapter 5 we defined the inverse of an n x n matrix. We noted that not all matrices have inverses, but when the inverse of a matrix exists, it 
is unique. This enables us to define the inverse of an n x n matrix A as the unique matrix B such that AB = B A = I , where / is the n x n 
identity matrix. In order to get some practical experience, we developed a formula that allowed us to determine the inverse of invertible 2x2 
matrices. We will now use the Gauss- Jordan procedure for solving systems of linear equations to compute the inverses, when they exist, of nxn 
matrices, n > 2. The following procedure for a 3 x 3 matrix can be generaUzed forn xn matrices, n > 2. 

Example 12.2.1. Given the matrix 

n 1 2 ^ 

A = 2 1 4 

,3 5 1 
we want to find the matrix 

'Xn X12 Xi3 
B= X21 X22 X23 
, X31 X32 X33 

if it exists, such that (a) A B = / and (b) B A = /. We wiU concentrate on finding a matrix that satisfies Equation (a) and then verify that B also 
satisfies Equation (b). 

( X[i X12 Xi3 
X21 X22 X23 

I j 

is equivalent to 

f Xn + X21 + 2 X31 X12 + X22 + 2 X32 Xi3 + X23 + 2 X33 

2xii + X21 + 4x31 2 X12 + X22 + 4 X32 2 xi3 + X23 + 4 X33 = 010 (12.2.a) 
l3xn +5x21 +X31 3x12 + 5x22 +X32 3 xb + 5 X23 + X33 

By definition of equaUty of matrices, this gives us three systems of equations to solve. The augmented matrix of one of the 12.2a systems, the 
one equating the first columns of the two matrices is: 

/ 1 1 2 M 



r 1 


1 


2 > 




2 


1 


4 




,3 


5 


1 , 







( 1 





\ 







1 







,0 





1 . 





( 1 





\ 







1 







,0 





1 . 



2 14 
13 5 1 J 



(12.2.b) 
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Using the Gauss- Jordan technique of Section 12.1, we have: 



n 1 2 1 \ 

2 14 
.3510, 



f 1 



' 1 1 
1 

.0 2 



1 



1 2 
0-10-2 
13 5 1 



1 

2 

-3 J 



Ml 2 
0-10 
.0 2-5 



-R2+R1 

and -2^2+^3 



1 - 

-2 
-3 , 

-1 ^ 



n 2 
0102 
.0 -5 -7 , 



102-1 
0102 

{0 17/5) 



-2R,+Ri 



''100 

1 
1 



-ii ^ 

5 

2 

7 
5 



So xn = -19/5, X21 = 2 and X31 = 7/5, which gives us the first column of the matrix B. The matrix form of the system to obtain xu, X22, and 
x-32 , the second column of B, is: 

n 1 2 Q\ 

2 14 1 (12.2.C) 
>3 5 1 , 
which reduces to 



r 1 
1 
1 



9 

5 

-1 

2 

5 / 



(12.2.d) 



The critical idea to note here is that the coefficient matrix in 12.2c is the same as the matrix in 12.2b, hence the sequence of row operations that 
we used to reduce the matrix in 12.2b can be used to reduce the matrix in 12.2c. To determine the third column of B, we reduce 

( 1 1 2 ^ 

2 14 
,3511 

to obtain X13 = 2/5, X23 = and X33 = -1 /5,. Here again it is important to note that the sequence of row operations used to "solve" this system 
is exactly the same as those we used in the first system. Why not save ourselves a considerable amount of time and effort and solve all three 
systems simultaneously? This we can effect by augmenting the coefficient matrix by the identity matrix /. We then have 



( ^ ^ 

2 1 

,3 5 1 1 



1 ^ 

1 



Same seqiienue ol' row 
operations as above 



1 
1 
1 



19 


9 


- ) 


" 5 


5 


5 


2 


-1 





7 


2 


1 


5 


5 


5 . 



So that 



B = 



19 

5 

2 

7 
5 



9 

5 
-1 
2 
5 



1 

5 J 



The reader should verify that B A = / so that A ^ = B. 

As the following theorem indicates, the verification that BA = Iis not necessary. The proof of the theorem is beyond the scope of this text. 

The interested reader can find it in most linear algebra texts . 

Theorem 122.1. Let A be an n x n matrix. If a matrix B can be found such that AB = I, then B A = I, so that B = A~' . In fact, to find 
A'\ we need only find a matrix B that satisfies one of the two conditions AB = I orBA = I. 

It is clear from Chapter 5 and our discussions in this chapter that not all n x n matrices have inverses. How do we determine whether a matrix 
has an inverse using this method? The answer is quite simple: the technique we developed to compute inverses is a matrix approach to solving 
several systems of equations simultaneously. 

Example 1222. The reader can verify that if 



( 1 



A = 



1 



-1 -2 -1 
5 8 
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then the augmented matrix 



( ' ^ 

-1 -2 

, 5 
reduces to 



1 1 ^ 

-2010 
8 1, 



(12.2.e) 



n 2 1 1 ^ 

1 1 
,058001, 

Although this matrix can be row-reduced further, it is not necessary to do so since in equation form we have: 

Xn+2X2\+ Xj,i = \ Xi2 + 2 Jr22 + ^32 = Xi3 + 2 j;23 + -^33 = 

(i) 0=1 (ii) = 1 (iii) = 

5 j;2i + 8 JTsi = 5 x^i + 8 ^^32 = 5 a:23 + 8 ^^33 = 1 

Clearly, there is no solution to Systems (i) and (ii), therefore A"' does not exist. From this discussion it should be obvious to the reader that the 
zero row of the coefficient matrix together with the nonzero entry in the fourth column of that row in matrix 12.2e tells us that A ' does not 
exist. 



EXERCISES FOR SECTION 12.2 
A Exercises 

1. In order to develop an understanding of the technique of this section, work out all the details of Example 12.2.1 . 

2. Use the method of this section to find the inverses of the following matrices whenever possible. If an inverse does not exist, explain why. 



(a) 



(-'. 



A 
-1 

2 



t 2 -1 
(c) -1 2 

. -1 

'6 7 2- 
(e) 4 2 1 

.611, 
3 . Same as question 2: 



(a) 



(c) 



(e) 



3 

!_ _ 
V 5 

( 1 -1 

-1 2 
-1 

12 3 A\ 
3 4 5 
.4 5 6 J 



(b) 



(d) 



/'O 3 2 5 ^ 
1-14 

11 

lO 1 3 -1 j 

M 2 1 ^ 

-2 -3 -1 
1 4 4 

/'2 1 3 - 
4 2 1 
8 2 4, 



(f) 



(b) 



(d) 



(f) 



10 3 
2-106 
2 10 

yo -1 3 2 ) 

no 

22-1 
,1-11 

( \ - 

2 3 

iii 
2 3 4 

iii 

V 3 4 5 

4. (a) Find the inverses of the following matrices. 

(-1 Q\ 

5 



(i) 



(2 0\ 

3 
,0 5, 



(ii) 



2 
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(b) If D is a diagonal matrix whose diagonal entries are nonzero, what is D ' ? 

5. Express each system of equations in Exercise 1, Section 12.1, in the form Ax = B. Solve each system by first finding A"' whenever 
possible. 

1 12.3 An Introduction to Vector Spaces 

When we encountered various types of matrices in Chapter 5, it became apparent that a particular kind of matrix, the diagonal matrix, was 
much easier to use in computations. For example, if A = [ ^ M, then A' can be found, but its computation is tedious. If 



then 




In a variety of applications it is beneficial to be able to diagonalize a matrix. In this section we will investigate what this means and consider a 
few appUcations. In order to understand when the diagonaUzation process can be performed, it is necessary to develop several of the underlying 
concepts of linear algebra. 

By now, you realize that mathematicians tend to generalize. Once we have found a "good thing," something that is useful, we apply it to as 
many different concepts as possible. In doing so, we frequently find that the "different concepts" are not really different but only look different. 
Four sentences in four different languages might look dissimilar, but when they are translated into a common language, they might very well 
express the exact same idea. 

Early in the development of mathematics, the concept of a vector led to a variety of applications in physics and engineering. We can certainly 
picture vectors, or "arrows," in the xj- plane and even in the three-dimensional space. Does it make sense to talk about vectors in four- 
dimensional space, in ten-dimensional space, or in any other mathematical situation? If so, what is the essence of a vector? Is it its shape or the 
rules it follows? The shape in two- or three-space is just a picture, or geometric interpretation, of a vector. The essence is the rules, or proper- 
ties, we wish vectors to follow so we can manipulate them algebraically. What follows is a definition of what is called a vector space. It is a list 
of all the essential properties of vectors, and it is the basic definition of the branch of mathematics called linear algebra. 

Definition: Vector Space. Let V be any nonempty set of objects. Define on V an operation, called addition, for any two elements 
x,yEV, and denote this operation by x + y. Let scalar multiplication be defined for a real number a e R and any element x eV and 
denote this operation by ax. The set V together with operations of addition and scalar multiplication is called a vector space over R if the 
following hold for all x, y, z eV , and a,b e R: 

(1) X + y = y + X 

(2) (x + y) + z = X + (y +z) 

(3) There exists a vector e V, such that itx+0 = x 

(4) For each vector x eV, there exists a unique vector -x e V, such that -x +xeV = 0. 

These are the main properties associated with the operation of addition. They can be summarized by saying that [V; +] is an abelian group. 
The next five properties are associated with the operation of scalar multiplication and how it relates to vector addition. 

(5) a(x + y) =ax +ay 

(6) (a + b)x = ax + bx 

(7) a{bx) = (ab)x 

(8) It = X. 

In a vector space it is common to call the elements ofV vectors and those from R scalar s. Vector spaces over the real numbers are also 
called real vector spaces. 

Example 123.1. Let V = M2x3(R) and let the operations of addition and scalar multiplication be the usual operations of addition and scalar 
multiplication on matrices. Then V together with these operations is a real vector space. The reader is strongly encouraged to verify the 
definition for this example before proceeding further (see Exercise 3 of this section). Note we can call the elements of M2x3(IR) vectors even 
though they are not arrows. 

Example 12.3.2. Let = ((ai, 02 ) | , 02 e R)- If we define addition and scalar multipUcation the natural way, that is, as we would on 

1x2 matrices, then is a vector space over R. (See Exercise 4 of this section. 

In this example, we have the "bonus" that we can illustrate the algebraic concept geometrically. In mathematics, a "geometric bonus" does not 
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always occur and is not necessary for the development or application of the concept. However, geometric illustrations are quite useful in 
helping us understand concepts and should be utilized whenever available. 

Let's consider some illustrations of the vector space . Let x= (1, 4) and y = (3, 1).\ 

We illustrate the vector (ai, 02) as a directed line segment, or "arrow," from the point (0, 0) to the point (ai, 02). The vectors x and y are as 

pictured in Figure 12.3.1 together with x + j = (1 , 4) + (3, 1) = (4, 5), which also has the geometric representation as pictured in Figure 
12.3.1. The vector 2 X = 2(1, 4) = (2, 8) is a vector in the same direction as x, but with twice its length. 




1 2 3 4 5 

Figure 12.3.1 
Addition in IR' 

Remarks: 

(1) We will henceforth drop the arrow above a vector name and use the common convention that boldface letters toward the end of the 
alphabet are vectors, while letters early in the alphabet are scalars. 

(2) The vector (ai, 02, . . . , a„) e R" is referred to as an n-tuple. 

(3) For those familiar with vector calculus, we are expressing the vector x = ai i + 027 + 03 e R^ as (01,02,03). This allows us to 
discuss vectors in R" in much simpler notation. 

In many situations a vector space V is given and we would like to describe the whole vector space by the smallest number of essential reference 
vectors. An example of this is the description of R^ , the xy plane, via the x and y axes. Again our concepts must be algebraic in nature so we 
are not restricted solely to geometric considerations. 

Definition: Linear Combination. A vector y in vector space V (over R) is a linear combination of the vectors Xj, X2, . . ., JC„ if there exist 
scalars O;, 02- ■■■•<^n ^ such that y = Uj Xj + 02 *2 + ••• +<^nX„ 

Example 1233 The vector (2, 3) in R^ is a linear combination of the vectors (1, 0) and (0, 1) since (2, 3) = 2(1, 0) + 3(0, 1). 

Example 1234. Prove that the vector (5, 4) is a linear combination of the vectors (4, 1) and (1,3). By the definition we must show that 
there exist scalars oi and 02 such that: 

(5, 4) = ai(4, 1) + 02(1, 3), 
which reduces to 

(5, 4) = (4ai + 02 , ai + 3 02), 
which gives us the system of linear equations 

4ai + 02 = 5 

oi + 3 fl2 =4 
which has solution oi = 1, 02 = 1- 

Another way of looking at the above example is if we replace ai and 02 both by 1 , then the two vectors (4,1) and (1,3) produce, or generate, 
the vector (5,4). Of course, if we replace ai and 02 by different scalars, we can generate more vectors from R^ . If a 1 =3 and 02 = -2, then 
ai(4, 1) + 02(1, 3) = 3(4, 1) +(-2) (1,3) 

= (12, 3) + (-2, -6) 

= (12 - 2, 3 - 6) = (10, -3) 
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Example 12.3.5. Will the vectors (4, 1) and (1, 3) generate any vector we choose in R-? To see if this is so, we let (bi, ^2) be an arbitrary 
vector in and see if we can always find scalars oi and 02 such that ai(4, 1) + 02(1. 3) = {b], b^). This is equivalent to solving the 
following system of equations: 

4ai + = b\ 
ai + 3 02 = ^2 

which always has solutions for and 02 regardless of the values of the real numbers b\ and bj. Why? We formalize in a definition: 

Definition: Generate. Let [Xj, X2, x„] be a set of vectors in a vector space V over R. This set is said to generate, or span, V if, for any 
given vector y eV, we can always find scalars aj, a2, a^ such that y = ajXj +a2X2 + ... + a„x„ . A set that generates a vector space is 
called a generating set. 

We now give a geometric interpretation of the above. 

We know that the standard coordinate system, x axis and y axis, were introduced in basic algebra in order to describe all points in the xy plane 
geometrically. It is also quite clear that to describe any point in the plane we need exactly two axes. Form a new coordinate system the follow- 
ing way: 

Draw the vector (4, 1) and an axis from the origin through (4, 1) and label it the x' axis. Also draw the vector (1,3) and an axis from the origin 
through (1, 3) to be labeled the y' axis. Draw the coordinate grid for the axis, that is, lines parallel, and let the unit lengths of this "new" plane 
be the lengths of the respective vectors, (4, 1) and (1 , 3), so that we obtain Figure 12.3.2. 

From Example 12.3.5 and Figure 12.3.2, we see that any vector on the plane can be described using the old (standard xy) axes or our new x'y' 
axes. Hence the position which had the name (4, 1) in reference to the standard axes has the name (1,0) with respect to the x' y' axes, or, in the 
phraseology of linear algebra, the coordinates of the point (1,3) with respect to the x ' j ' axes are ( 1 , 0) . 







/ / 
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Figure 12.3.2 



Example 12.3.6. From Example 12.3.4 we found that if we choose ai = I and 02 = 1. then the two vectors (4, 1) and (1, 3) generate the vector 
(5, 4). Another geometric interpretation of this problem is that the coordinates of the position (5, 4) with respect to the x' y' axes of Figure 
12.3.2is(l, l).In other words , a position in the plane has the name (5 , 4) in reference to the xy axes and the same position has the name ( 1 , 1 ) 
in reference to the x ' j ' axes . 

From the above, it is clear that we can use different axes to describe points or vectors in the plane. No matter what choice we use, we want to be 
able to describe each position in a unique manner. This is not the case in Figure 12.3.3. Any point in the plane could be described via the x' y' 
axes, the x' z' axes or the j' axes. Therefore, in this case, a single point would have three different names, a very confusing situation. 

We formalize the above discussion in two definitions and a theorem. 
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z 



Figure 12.3.3 

Definition: Linear Independence/Linear Dependence. The set of vectors {Xj,X2, .... Jt/,) a vector space V (over R) is linearly indepen- 
dent if the only solution to the equation ajXj +a2X2 + ■■■ -ha^x^ = is aj =02 = ... = a„ = 0. Otherwise the set is called a linearly 
dependent set. 

Definition: Basis. A set of vectors B ={Xj,X2, Xn) is a basis for a vector space V(overR)if: 

(1) B generates V, and 

(2) B is linearly independent. 

Theorem 12.3.1. If {xi, X2, .... x„} is a basis for a vector space V over R, then any vector y e V can be uniquely expressed as a linear 
combination of the Xi's. 

Proof: Assume that {xi, X2, .... x„} is a basis for V over R. We must prove two facts: 

(1) each vector y e V can be expressed as a linear combination of the x, ' s, and 

(2) each such expression is unique. 

Part (1) is trivial since a basis, by its definition, must be a generating set for V. 

The proof of (2) is a bit more difficult. We follow the standard approach for any uniqueness facts. Let y be any vector in V and assume that 
there are two different ways of expressing y, namely 

y = aiX\ + a2X2 + ... + a„Xn 

and 

y = bixi +b2X2 + ... +b„x„ 
where at least one a, is different from the corresponding bt. Then equating these two linear combinations we get 
aiXi +a2X2 + ... +a„Xn =biXi +62X2 + ... +bnX„ 

so that 

(fli - h[)X[ +(a2-h2)X2 + ... + (a„ - h„)x„ = 

Now a crucial observation: since the Xi' s form a linearly independent set, the only solution to the previous equation is that each of the coeffi- 
cients must equal zero, so a,- - ft; = for = 1, 2, ... , «. Hence a, = bi, for all /. This contradicts our assumption that at least one a; is 
different from the corresponding bi, so each vector y e V can be expressed in one and only one way. ■ 

Theorem 12.3.1 , together with the previous examples, gives us a clear insight into the meaning of linear independence, namely uniqueness. 

Example 123.7. Prove that ((1, 1), (-1, 1)) is a basis for R^ over R and explain what this means geometrically. First we must show that the 
vectors (1, 1) and (-1, 1) generate all of R^. This we can do by imitating Example 12.3.5 and leave it to the reader (see Exercise 10 of this 
section). Secondly, we must prove that the set is linearly independent. 
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Let oi and be scalars such that ai(l, 1) + 02 |-1 . 1) = (0> 0)- We must prove that the only solution to the equation is that ai and 02 must 
both equal zero. The above equation becomes (oi - 02 , fli + 02 ) = (0, 0) which gives us the system 

ai - 02 = 

a I + 02 = 

The augmented matrix of this system reduces in such way that the only solution is the trivial one of all zeros: 



(1 -1 0\ /l 0\ „ 

li 1 oj^lo 1 oj ^ «i=«2=0 

Therefore, the set is linearly independent. 



To explain the results geometrically, note through Exercise 12, part a, that the coordinates of each vector y eR^ can be determined uniquely 
using the vectors (1,1) and (-1, 1). The concept of dimension is quite obvious for those vector spaces that have an immediate geometric 
interpretation. For example, the dimension of is two and that of is three. How can we define the concept of dimension algebraically so 
that the resulting definition correlates with that of R^ and R^ ? First we need a theorem, which we will state without proof. 
Theorem 1232. If V is a vector space with a basis containing n elements, then all bases of V contain n elements. 

Definition: Dimension. Let V be a vector space over R with basis {xi, X2, x„). Then the dimension of V is n. We use the notation 
dim V = n to indicate that Vis n-dimensional 

EXERCISES FOR SECTION 12.3 
A Exercises 

1. Ifa = 2,b = -3, 

/ 1 -1 \ (2 -2 3\ / 1 

^ = (2 3 4 )' ^ = (4 5 8P"'^^ = (3 2 -2 

verify that aU properties of the definition of a vector space are true for M2x3 (R) with these values. 

2. Let a = 3, = 4, X = (-1, 3), y = (2, 3),and z = (1, 0). Verify that aU properties of the definition of a vector space are true for R^ for 

these values. 

3. (a) Verify that 7^2x3 (R) is a vector space over R. 

(b) Is M„x„ (R) a vector space over R? 

4. (a) Verify that R^ is a vector space over R. 

(b) Is R" a vector space over R for every positive integer n? 

5. Let = {ao + oix + 02 + a^x^ \ ao, fli , 02, 03 £ T^); that is, is the set of all polynomials in x having real coefficients with degree 
less than or equal to 3. Verify that P^ is a vector space over R. 

6. For each of the following, express the vector y as a linear combination of the vectors xi and X2. 

(a) y = (5, 6), Xi =(1, 0),andx2 = (0, 1) 
Qa) y = (2, 1), x, =(2, l),and X2 = (1, 1) 

(c) y = (3,4), xi = (1, l),and X2 = (-1, 1) 



(I 2\ 

7. Express the vector I 3 3 I ^ ^2x2(IR)> as a linear combination of 



1 1 w -1 5 WO n /o 

and 



V 1 1 /' V 2 1 u 1 ; vo 1 

8 . Express the vector x'^3-4x- + 3eP-'asa linear combination of the vectors l,x,x^ , and x^ . 

9. (a) Show that the set [xi , X2} generates R^ for each of the parts in Exercise 6 of this section. 

(b) Show that |jci , ^2, X3) generates R^ where Xi = (1, 1), X2 = (3,4),andX3 = (-1, 5). 

(c) Create a set of four or more vectors that generates R^ . 

(d) What is the smallest number of vectors needed to generate R^? R"? 

(e) Show that the set of matrices containing 
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generates M2x2('R) 



(f) Show that {I, X, X- , x^] generates . 

10. Complete Example 12.3.7 by showing that {(1, 1), (-1, 1)) generates IR^ 

11. (a) Prove that 1(4, 1), (1, 3)) is a basis for over R. 

(b) Prove that 1(1, 0), (3, 4)} is a basis for R^ over R. 

(c) Prove that ((1,0, -1), (2, 1, 1), (1, -3, -1)) is a basis for R^ over R. 

(d) Prove that the sets in Exercise 9, parts e and f, form bases of the respective vector spaces. 

12. (a) Determine the coordinates of the points or vectors (3, 4), (-1, l),and(l, 1) with respect to the basis {(1, 1),(-1, 1)} of R^. Interpret 
your results geometrically, 

(b) Determine the coordinates of the points or vector (3, 5, 6) with respect to the basis ((1, 0, 0), (0, 1, 0), (0, 0, 1)). Explain why this 
basis is called the standard basis for R^ . 

13. (a)Letj'i= (1,3, 5, 9), y2 = (5,7, 6, 3),andc = 2. Find +3^2 and c^-i. 

(b) Let Mx) = I + 3x + 5x^ + 9x^ ,f2(.x) = 5 + 7 x + 6x^ + 3x^ andc = 2.Find/i(;i:) +/2(A:)andc/i(.x). 

(c) LetA=^^ ^j,B = ^^ ^ j,andc = 2 .FindA + BandcA. 

(d) Are the vector spaces R"* , and M2x2(IR) isomorphic to each other? Discuss with reference to parts a, b, and c. 
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1 12.4 The Diagonal ization Process 



We now have the background to understand the main ideas behind the diagonaUzation process. 

Definition: Eigenvalue, Eigenvector. Let A be an nxn matrix over R. A is an eigenvalue of A if for some nonzero column vector 
X e 1^ we have Ax = Ax. x is called an Eigenvectors corresponding to the eigenvalue X. 

Example 124.1. Find the eigenvalues and corresponding eigenvectors of the matrix = ^ 2 3 )' ^® '^^1- to find nonzero vectors = ^ j 



and real numbers A such that 
AX = XX 



[lit 

(lllD-itHl) 
[\\)[ 



2 1 
2 3 



1 j U2 

1 



I 



-('"3-j(:)=(2) <-> 

The last matrix equation wiU have nonzero solutions if and only if 



det 



/'2-A 1 \ „ 
( 2 3-a)=° 



2 3- 

or (2 - A) (3 - A) - 2 = 0, which simplifies toA^-5A + 4 = 0. Therefore, the solutions to this quadratic equation, Ai = 1 and A2 = 4, 
are the eigenvalues of A. We now have to find eigenvectors associated with each eigenvalue. 

Case l.For A] = 1, Equation 12.4a becomes: 



'2-1 1 
2 3- 

,2 2)(x2)"(o 

which reduces to the single equation, x\+ X2 = 0. From this, x\ = -X2. This means the solution set of this equation is (in column notation) 



So any column vector of the form ^ where c is any nonzero real number is an eigenvector associated with Ai = 1 . The reader should 
verify that, for example, 



so that 



'2 1 
>2 3 

/ 2 \ 

3 

2 

V '3 / 



I I \ 

3 

2 

3 J 



( I \ 

3 

2 

V i ) 



is an eigenvector associated with eigenvalue 1. 



Case 2. For A2 = 4 equation 12.4 .a becomes: 



^2-4 1 \lxi\(0 
I 2 ?,~a]\x2} lo 

(2 -1 )(x2)"(o ) 

which reduces to the single equation -2 xi + X2 = 0, so that X2= 2 xi . The solution set of the equation is 



E2 



-i 



-2 c 
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Therefore, all eigenvectors of A associated with the eigenvalue X2 = 4 are of the form i ^ ) , where c can be any nonzero number 



c 

I -2c 

The following theorems summarize the more important aspects of this example: 

Theorem 12.4.1. Let A be any n xn matrix over R. Then Ae Ris an eigenvalue of A if and only ifdet{A - AT) = 0. 

The equation det(A - A/) = is called the characteristic equation and the left side of this equation is called the characteristic polynomial of 
A. 

Theorem 12.4.2. Nonzero eigenvectors corresponding to distinct eigenvalues are linearly independent. 

The solution space of (A - A/) x = is called the eigenspace of A corresponding to A. This terminology is justified by Exercise 2 of this section. 

We now consider the main aim of this section. Given an n xn (square) matrix A, we would like to "change" A into a diagonal matrix D, perform 
our tasks with the simpler matrix D, and then describe the results in terms of the given matrix A. 

Definition: Diagonalizable Matrix. An nxn matrix A is called diagonalizable if there exists an invertible nxn matrix P such that 
P~' AP is a diagonal matrix D. The matrix P is said to diagonalize the matrix A. 

Example 12^.2. We wiU now diagonalize the matrix A of Example 12.4.1. Form the matrix P as follows: Let P'^' be the first column of P. 
Choose for P<'' any eigenvector from Ei. We may as well choose a simple vector in Ei so P*'* = ^ \ ) '^^^ candidate. Similarly, let P*^' be 
the second 

column of P, and choose for P<^' any eigenvector from E2. The vector P*^' ~ ( 2 ) reasonable choice, thus 



2 
3 
i_ 

V 3 



1_ \ 

3 
1_ 
3 



So that 



P-MP = 



2 -1 
1 1 



2 1 
2 3 



1 1 
-1 2 



1 
4 



Notice that the elements on the main diagonal of D are the eigenvalues of A, where Da is the eigenvalue corresponding to the eigenvector P''' . 
Remarks: 

(1) The first step in the diagonalization process is the determination of the eigenvalues. The ordering of the eigenvalues is purely arbitrary. If 

M 

we designate Ai = 4 and A2 = 1 , the columns of P would be interchanged and D would be I ^ ^ 
less, the final outcome of the application to which we are applying the diagonalization process would be the same. 

(2) IfAisannXn matrix with distinct eigenvalues , then P is also an n x n matrix whose columns P* ' ' , P*^' , . . . , P*"' are n linearly independent 
vectors. 

Example 12^3. Diagonalize the matrix 



(see Exercise 3b of this section). Nonethe- 



A = 



n 12 -18 ^ 

-11 18 
,0-6 10 

n-x 

det(A-A/) = det 
. 

= (1 -A)det( 



12 
-A- 11 
-6 

-A - 11 



-18 
18 
10- A j 

18 



-6 10 - A 
= (1 - A) ((-A - 11) (10 - A) + 108) 
= (1 -A)(A2 + A-2) 

Hence, the equation det(A - A/) = becomes 

(1 - A) (A^ + A - 2) = - (A - 1)2 (A + 2) 

Therefore, our eigenvalues for A are Ai = -2 and A2 = 1. We note that we do not have three distinct eigenvalues, but we proceed as in the 
previous example. 

Case 1 . For Ai = -2 the equation (A-XI)x= becomes 
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1 3 12 -18 ^ 
-9 18 
.0-6 12 , 



( xi 



10 \ 


.0 . 



Using Mathematica, we can row reduce the matrix: 



/3 12 -18 



RowReduce 



0-9 18 
^0 -6 12 } 

n 2 \ 

1 -2 
.0 0, 

In equation form, the matrix equation is then equivalent to 
xi = -2xi 

X2- 2 X3 

Therefore, the solution, or eigenspace, corresponding to Xi = -2 consists of vectors of the form 



( -2x3) 




(-2\ 


2x^ 


= X-i 


2 


. X3 . 




. 1 . 



Therefore 



-2\ 
2 

\ 1 . 



is an eigenvector corresponding to the eigenvalue Ai = -2, and can be used for our first column of P: 



p(i) = 



(-2 
2 

. 1 ; 



Before we continue we make the observation: E2 is a subspace of with basis {P''*) and dim £1 = 1. 
Case 2. If A2 = 1 , then the equation (A -\I)x= becomes 

I'D 12 -18 
-12 18 
.0-69 

Without the aid of any computer technology, it should be clear that all three equations that correspond to this matrix equation are equivalent to 

3 

2 - 3 j;3 = 0, or X2 = - X3 . Notice that x\ can take on any value, so any vector of the form 





(X\ \ 




ro ^ 




Xl 









, XT, , 




,0 , 







{ ' ^ 




{ ^ \ 











3 






+ X3 












2 


. x^ , 




,0 J 




I 1 ] 



will solve the matrix equation. 

We note that the solution set contains two independent variables, xi and X3. Further, note that we cannot express the eigenspace E2 as a linear 
combination of a single vector as in Case 1 . However, it can be written as 



£2 = {^1 



{ ^ ^ 




f ^ \ 







3 


+ X3 


2 


.0 , 




. 1 J 



We can replace any vector in a basis is with a nonzero multiple of that vector. Simply for aesthetic reasons, we wiU multiply the second vector 



that generates £2 by 2. Therefore, the eigenspace £2 is a subspace of with basis I , 3 | and so dim £2 = 2. 

What this means with respect to the diagonaUzation process is that A2 = 1 gives us both Column 2 and Column 3 the diagonaUzing matrix. 
The order is not important. Let 





f ^ 1 




ro ^ 




(-2 1 ^ 


p(2) = 





and P<3' = 


3 


and so P = 


2 3 




.0 , 




.2 , 




.10 2, 
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The reader can verify (see Exercise 5 of this section) that 



P-1 = 



1 

.0 



-3 ^ 
-6 



-1 2 



and P-^AP 



^ 

1 
1 



In doing Example 12.4.3, the given 3x3 matrix A produced only two, not three, distinct eigenvalues, yet we were still able to diagonalize A. 
The reason we were able to do so was because we were able to find three linearly independent eigenvectors. Again, the main idea is to produce 
a matrix P that does the diagonalizing. If A is an n x n matrix, P will be an nxn matrix, and its n columns must be linearly independent 
eigenvectors. The main question in the study of diagonalizability is "When can it be done?" This is summarized in the following theorem. 

Theorem 12.43. Let Abe an n x n matrix. Then A is diagonalizable if and only if A has n linearly independent eigenvectors. 

Outline of a proof: («=) Assume that A has linearly independent eigenvectors , P'^' , P'^' , . . . , P*"' , with corresponding eigenvalues Ai , A2 , . . . , A„ . 
We want to prove that A is diagonalizable. Column i of the n xn matrix AP is AP*'' (see Exercise 7 of this section). Then, since the P*'' is an 
eigenvector of A associated with the eigenvalue A, we have AP'"' = AjP*'* for; = 1, 2, .... n. But this means that AP = PD, where D is 
the diagonal matrix with diagonal entries Ai , A2, . . . , A„. If we multiply both sides of the equation by P~' we get the desired P~' AP = D. 

(==>) The proof in this direction involves a concept that is not covered in this text (rank of a matrix); so we refer the interested reader to 
virtually any linear algebra text for a proof. ■ 

We now give an example of a matrix which is not diagonalizable. 



Example 12AA. Let us attempt to diagonalize the matrix A 



1 1 





^ 

2 1 
-1 4, 



1 

2 

,1 -1 





1 

4 ) 



I 1 

.1 



- 
2 1 
-1 4 , 



/ 1 -A 



det(A -XI)= det 



-A ) 



det(A -XI) : 



( 
2-A 

-1 4- 

= (l-A)det(^Jj'^ 

= (1 -A)((2-A)(4-A) + l) 
= (1 - A)(A2 -6A +9) 
= (l-A)(A-3)2 
^ A=lorA = 3 



Therefore there are two eigenvalues, Ai = 1 and A2 = 3. Since Aj is an eigenvalue of degree it will have an eigenspace of dimension 1. Since 
A2 is a double root of the characteristic equation, the dimension of its eigenspace must be 2 in order to be able to diagonalize. 

Case 1 . For Ai = 1 , the equation (A - A /) ;c = becomes 



AO 





^ 




' xi - 




(0 \ 





1 


1 











. 1 


-1 


3 , 








.0 . 



A quick Mathematica evaluation make the solution to this system obvious 
RowReduce [A - identltyMatrlx [3] ] 

n Q A\ 

1 1 

>o 0, 

There is one free variable, X'i, and 
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' -4 X3 ' 




-_4- 








= X3 


-1 


.^3 , 




. ^3 , 




, 1 , 



Hence, 



f-4\ 



y 1 ) 



is a basis for the eigenspace of Ai = 1 . 



Case 2. ForA2= 3, the equation (A - A /) j: = becomes 




I 1 



-1 1 





' xi - 




/O \ 













.^3 , 




.0 , 



RowReduce [A - 3 IdentityMatrix [3 ] ] 



/ 1 





\ 





1 


-1 


,0 





, 



Once again there is only one free variable in the row reduction and so the dimension of the eigenspace will be one: 



'Xl ) 




( ^ 




I'D ■ 


X2 




X3 


= X-i 


1 


> Xj . 




.Xi . 




. 1 , 



Hence, 



/'O ^ 
1 

1 ) 



I is a basis for the eigenspace of A2 = 3. This means that At = 3 produces only one column for P. Since we began with only two 



eigenvalues, we had hoped that one of them would produce a vector space of dimension two, or, in matrix terms, two linearly independent 
columns of P. Since A does not have three linearly independent eigenvectors A cannot be diagonalized. 



Mathematica Note 



Diagonalization can be easily done with a few built-in functions of Mathematica. Here is a 3 x 3 matrix we've selected because the eigenval- 
ues are very simple, and could be found by hand with a little work. 

4 10 
15 1 

10 14; 

The set of linearly independent eigenvectors of A can be computed: 
Eigenvectors [A] 

f I 2 I \ 
-1 1 
,1 -11, 

The rows of this matrix are the eigenvectors, so we transpose the result to get our diagonalizing matrix P whose columns are eigenvectors. 
P = Transpose [Eigenvectors [A] ] 

/ 1 -1 I \ 
2 0-1 
.11 1 , 

We then use P to diagonalize. The entries in the diagonal matrix are the eigenvalues of A. 
Inverse [P] . A. P 



(6 





\ 





4 





,0 





3 , 



We could have gotten the eigenvalues directly this way: 
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Eigenvalues [A] 

(6,4,31 

Most matrices that are selected at random will not have "nice" eigenvalues. Here is a new matrix A that looks similar to the one above. 



A = 



8 10 
15 1 

\0 1 1 I 



Asking for the eigenvalues first, we see that the result is returned symbolically as the three roots to a cubic equation. The default for Mathemat- 
ica is to leave these non-computed. Since the entries of A are exact numbers, Mathematica is capable of giving an exact solution, but it's very 
messy. The easiest way around the problem is to make the entries in A approximate. The following expression redefines A as approximate. 

A = N[A] 

8. 1. 0. \ 
1. 5. 1. 
\Q. 1. 7., 

Now we can get approximate eigenvalues, and the approximations are very good for most purposes. 
Eigenvalues [A] 
(8.3772, 7.27389,4.34891) 

We can verify that the matrix can be diagonalized although due to round-off error some of the off-diagonal entries of the "diagonal" matrix are 
nonzero. 

P = Transpose [Eigenvectors [A] ] 

('0.906362 -0.341882 0.248244 ^ 
0.341882 0.248244 -0.906362 
,0.248244 0.906362 0.341882 , 

Inverse [P] . A. P 



8.3772 
0. 

1, 1.66533 xlO"'^^ 



2.22045x10-"^ 6.66134x10-'* ^ 
7.27389 4.44089x10-"' 



.44089x10- 



4.34891 



The Chop function will set small numbers to zero. The default thresh hold for "small" is 10 but that can be adjusted, if desired. 
Diag = Chop [Inverse [P] .A.P] 



('8.3772 ^ 

7.27389 
^ 4.34891 , 

We can't use the name D here because Mathematica reserves it for the differentiation function. 

If you experiment with more matrices, you will undoubtedly encounter situations where some eigenvalues are complex. The process is the 
same, although we've avoided these just for simplicity. 

pTi^ Sage Note 



We start by defining the same matrix as we did in Mathematica. We also declare D and P to be variables. 



A = Matrix (QQ, [ [4, 1, 0] 


[1, 5, 1], [0, 1, 4]] 


) ;A 


[4 1 0] 






[1 5 1] 






[0 1 4] 






var ( ' D, P ' ) 






(D- P) 







We have been working with "right eigenvectors" since the x in Ax = Ax is a column vector to the right of A. It's not so common but still 
desirable in some situations to consider "left eigenvectors," so Sage allows either one. The right_eigenmatrix method returns a pair of 
matrices. The diagonal matrix, D, with eigenvalues and the diagonalizing matrix, P, which is made up of columns that are eigenvectors 
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corresponding to the eigenvectors of D. 



(D,P)=A.right_eigenmatrix( ) ; (D,P) 



( 

[6 0] [1 1 1] 
[0 4 0] [2 -1] 
[0 3], [ 1 -1 1] 
) 

We should note here that P is not unique because even if an eigenspace has dimension one, any nonzero vector in that space will serve as an 
eigenvector. For that reason, the P generated by Sage isn't identical to the one generated by Mathematica, but they both work. Here we verify 
the result for our Sage calculation. Recall that an asterisk is used for matrix multiplication in Sage. 

P. inverse ( ) *A*P 
[6 0] 
[0 4 0] 
[ 3] 



Here is a second matrix, again the same as we used with Mathematica. 



A2=Matrix(QQ, [[8, 1,0], [1,5,1], [0-1-7]]); A2 

[8 1 0] 
[1 5 1] 
[0 1 7] 

Here we've already specified that the underlying system is the rational numbers. Since the eigenvalues are not rational. Sage will revert to 
approximate number by default. We'll just pull out the matrix of eigenvectors this time and display rounded entries. Here the diagonalizing 
matrix looks very different from the result from Mathematica, but this is because he eigenvalues are not in the same order in the two calcula- 
tions. They both diagonalize but with a different diagonal matrix. 



P=A2 . right_eigenmatrix ( ) [ 1 ] 
P . numer ical_approx ( digits=3 ) 

[ 1.00 1.00 1.00] 
[ -3.65 -0.726 0.377] 
[ 1.38 -2.65 0.274] 
D= ( P . inverse ( ) *A2 *P ) ; D . numerical_approx ( digits=3 ) 
[ 4.35 0.000 0.000] 
[0.000 7.27 0.000] 
[0.000 0.000 8.38] 



EXERCISES FOR SECTION 12.4 
A Exercises 



2 1 



1. (a) List three different eigenvectors of A = ^ 3 j' '^^ matrix of Example 12.4.1, associated with the two eigenvalues 1 and 4. Verify your 
results. 

((b) Choose one of the three eigenvectors corresponding to 1 and one of the three eigenvectors corresponding to 4, and show that the two 
chosen vectors are linearly independent. 

2. (a) Verify that Ei and E2 in Example 12.4.1 are vector spaces over IR. Since they are also subsets of IR-, they are called subvector-spaces, or 
subspaces for short, of IR-. Since these are subspaces consisting of eigenvectors, they are called eigenspaces. 

(b) Use the definition of dimension in the previous section to find dim Ei and dim £2 • Note that dim Ei+ dim E2 = dim . This is not a 
coincidence. 

3. (a) Verify that P"' A P is indeed equal ^ q 4 j' indicated in Example 12.4.2. 

(b) Choose P<" = Q j and ~ ( \ ) ^"'^ verify that the new value of P satisfies p-' AP = ^^ ^ j 

(c) Take any two linearly independent eigenvectors of the matrix A of Example 1 2 .4 .2 and verify that ' A P is a diagonal matrix . 
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1 1 

.10 2, 



4. (a) Let A be the matrix in Example 12.4.3 and P = 
P-^AP 

(b) If you choose the columns of P in the reverse order, what is P"' A P? 

5. Diagonalize the following, if possible: 



Without doing any actual matrix multiplications, determine the value of 



(a) 


' I 2\ 
y3 2) 


(b) 




(c) 


(»:) 




l' 1 -1 4 ^ 




(6 ^ 




f I -1 \ 


(d) 


3 2-1 


(e) 


7-4 


(f) 


-1 2 -1 




.2 1 -1, 




.913, 




. -1 1 , 



6. Diagonalize the following, if possible: 



(a) 



(d) 











1) 




f 1 


3 


6 ^ 


-3 


-5 


-6 


. 3 


3 


6 , 



(b) 



(e) 



2 1 
4 2 

M 1 'I 
1 1 

.011, 



(c) 



(f) 



f 2 -1 \ 

-1 2 -1 
. -1 2 , 



B Exercise 

7. Let A and P be as in Example 12. 4 .3. Show that the columns of the matrix A P can be found by computing A P*'', A P*^', ...,AP*"'. 

8. Prove that if P is an nxn matrix and D is a diagonal matrix with diagonal entries di, d2, d„, then PDis the matrix obtained fi-om P, but 
multiplying column/ of P by d,,! = 1, 2 n . 

C Exercise 

9. (a) There is an option to the Mathematica functions Eigenvectors and Eigenvalues called Cublcs that will use the cubic equation 

(S 1 0\ 



to find exact eigenvalues of a matrix like 



. Use that option to find the exact eigenvalues of the matrix. Diagonalize the matrix using 



1 5 1 

10 1 7 j 

the Cublcs option and then convert the result to a matrix of approximate numbers to compare your result with the approximate result we 
found in the Mathematica Note. 



1 12.5 Some Applications 



A large and varied number of applications involve computations of powers of matrices. These applications can be foimd in science, the social 
sciences, economics, the analysis of relationships with groups, engineering, and, indeed, any area where mathematics is used and, therefore, 
where programs are to be developed. We will consider a few diverse examples here. 

To aid your understanding of the following examples, we develop a helpful technique to compute A™, m > 1 . If A can be diagonalized, then 
there is a matrix P such that P~' A P = D, where D is a diagonal matrix and 

A" = PD"P-1 foraUm> 1. (12.5 a) 

You are asked to prove this equation in Exercise 9 of Section 5.4. The condition that £) be a diagonal matrix is not necessary but when it is, the 
calculation on the right side is particularly easy to perform. Although the formal proof of equation 12.4a is done by induction, the reason why it 
is true is easily seen by writing out an example such as m = 3: 

A" = (P D P-^)" To get this , solve P ' A P = D for A and substitute 
= (PDP-^)(PDP-^)(PDP-^) 

= PD(P-^ P)D(P-^ P)DP-^ by associativity of matrix mult. 
= PDIDIDP-^ 
= PDDDP-^ 
= PD^P-^ 

Example 12S.1: Recursion. Consider the computation of terms of the Fibonacci sequence, which we examined in Example 8.1.5: 

Fo= 1, Pi = 1 



Fk-i+Fk-2 forfe>2. 
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In order to formulate the calculation in matrix form, we introduced the "dummy equation" F),_^ = Fi^_^ so that now we have two equations 



Pk-\ - Pk-i 

These two equations can be expressed in matrix form as 



1 1 
1 



Fk-i 



A^ii'-^ I if ^:>3 



iik>2 

1 1 
1 



etc . if is large enough 
We can use induction to prove that if ^: > 2, 



Next, by diagonalizing A and using the fact that A™ = P D'" P ' . we can show that 







k 




k\ 


V 


2 

\ J 






^ J 


/ 



See Exercise la of this section. 
Comments: 

(1) An equation of the form Fj. = aFj-_i + bFj^_2 , where a and b are given constants, is referred to linear homogeneous second-order 
difference equation. The conditions Fq = cq and Fi = ci , where ci and ci are constants, are called initial conditions. Those of you who are 
familiar with differential equations may recognize that the this language parallels what is used in differential equations. Difference (AKA 
recurrence) equations move forward discretely— that is, in a finite number of positive steps— while a differential equation moves continu- 
ously—that is, takes an infinite number of infinitesimal steps. 

(2) A recurrence relationship of the form F^- = a Fj._i + b, where a and b are constants, is called a first-order difference equation. In order to 
write out the sequence, we need to know one initial condition. Equations of this type can be solved similarly to the method outlined in Example 
12.5.1 by introducing the superfluous equation 1=0 Fj-_i -f 1 to obtain in matrix equation: 



a b 
1 



Fk-i 
1 



a b 
1 



Example 12.5.2: Graph Theory. Consider the graph in Figure 12.5.1. 




Figure 12.5.1 

From the procedures outlined in Section 6.4, the adjacency matrix of this graph is 

n I 
A= 1 1 
.0 1 1 j 

Recall that A'' is the adjacency matrix of the relation , where r is the relation {(a, a), (a, b), (b, a), (b, c), (c, b), (c, c)) of the above 
graph. Also recall that in computing A*, we used Boolean arithmetic. What happens if we use "regular" arithmetic? For example, 
rl I I ■\ 
A = 12 1 
.112, 

How can we interpret this? We note that A33 = 2 and that there are two paths of length two from c (the third node) to c. Also, An = 1, and 
there is one path of length 2 from a to c . The reader should verify these claims from the graph in Figure 1 2 .5 . 1 . 

Theorem 12.5.1. The entry (A*).^. is the number of paths, or walks, of length k from node v,, to node vj . 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlikes 3.0 United States License. 



Chapter 12 - More Matrix Algebra 



How do we find A* for possibly large values of kl From the discussion at the beginning of this section, we know that A'' = PL^ P ' if A is 
diagonalizable. We leave to the reader to show that A = 1 , 2, and - 1 are eigenvalues of A with eigenvectors 



respectively, so that 



( ^ ] 




f ^ 1 




f ^ 1 







1 


, and 


-2 


.-1, 




,1 . 




. 1 . 



Cl 



A*= P 







2* 
[0 (-l)M 



where P ■■ 



( I I I \ 
1 -2 
-1 1 1 J 



and P-' 



2 



3 6 y 



See Exercise 5 of this section for the completion of this example. 

Example 12.53: Matrix Calculus. Those who have studied calculus recall that the Maclaurin series is a useful way of expressing many 
common functions. For example, 

= y — 

k=0 

Indeed, calculators and computers use these series for calculations. Given a polynomial f(x), we defined the matrix-polynomial / (A) for square 
matrices in Chapter 5 . Hence, we are in a position to describe c'* for an n x n matrix A as a limit of polynomial. Formally, we write 

Again we encounter the need to compute high powers of a matrix. Let A be an nxn diagonalizable matrix. Then there exists an invertible nxn 
matrix P such that P'' A P = D,a diagonal matrix, so that 

~ (PDP-'f 
k=0 



kl 



The infinite sum in the middle of this final expression can be easily evaluated if D is diagonal. All entries of powers off the diagonal are zero 
and the ;* entry of the diagonal is 



k=0 



For example, '^'^ = ^2 3)'*^* ^^^^ matrix we diagonaUzed in Section 12.3, we found that ^ = ^ 2 ) ^ ~ ( 4 ) ' 







f- -M 

3 3 

- - 

k 3 3 / 




( 2e e' 


e e* \ 




T + T 


3 3 




2e 2e* 
3 3 


f 2 e* 

- -\ 

3 3 / 



/ 20.0116 17.2933 
i 34.5866 37.3049 
Comments on Example 12.5.3: 

(1) Many of the ideas of calculus can be developed using matrices. For example, if 

,3 ■Xt2 . 



A{t) 



[e' 2 I 
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then 



dm^nf 6/ + 8 ^ 

[ e' > 



(2) Many of the basic formulas in calculus are true in matrix calculus. For example, 



d(A(t)+B(t)) _ dA(t) clB(t) 
(It dt dt 



and if A is a constant matrix, 

Ac'*' 



dt 



(3) Matrix calculus can be used to solve systems of differential equations in a similar manner to the procedure used in ordinary differential 
equations . 

Mathematica Note 

Mathematica'i matrix exponential function is MatrixExp. 
MatrixExp[(^ 



pFi^ Sage Note 

Sage's matrix exponential method is called exp. 



A=Matrix(QQ,[[2,l],[2,3]]); 
A. exp( ) 

[ 2/3*e + l/3*e'4 -l/3*e + l/3*e'4] 
[-2/3*e + 2/3*e"4 l/3*e + 2/3*e"4] 



EXERCISES FOR SECTION 12.5 
A Exercises 

1. (a) Write out all the details of Example 12.5.1 to show that the formula for F^. given in the text is correct, 
(b) Use induction to prove the assertion made in Example 12.5.1 that 




2. (a) Do Example 8.3.8 of Chapter 8 using the method outlined in Example 12.5.1. Note that the terminology characteristic equation, 
characteristic polynomial, and so on, introduced in Chapter 8, comes from the language of matrix algebra, 

(b) What is the significance of Algorithm 8.3.1 , part c, with respect to this section? 

3. Solve S(k) = 5 S(k - 1) + 4, with 5(0) = 0, using the method of this section. 

4. How many paths are there of length 6 between vertex 1 and vertex 3 in Figure 12.5.2? How many paths from vertex 2 to vertex 2 of length 
6 are there? Hint: The characteristic polynomial of the adjacency matrix is X*. 
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1 


3 


1 


4 


/ \ 
\- 




l\ 
\l 


2 




1 









Figure 12.5.2 

5. Use the matrix A of Example 12.5.2 to: 

(a) Determine the number of paths of length 1 that exist from vertex a to each of the vertices in Example 12.5.2. Verify using the graph. 
Do the same for vertices h and c. 

(b) Verify all the details of Example 12.5.2. 

(c) Use Example 12.5.2 to determine the number of paths of length 4 there are from each node in the graph of Figure 12.5.1 to every 
node in the graph. Verify your results using the graph. 



6. Let A = 
(a) Find c** 



2 -1 
-1 2 



(-1)'/ 



(b) Recall that sin x = 2 (2k+i)'. ^""^ compute sin A 



(d) Formulate a reasonable definition of the natural logarithm of a matrix and compute In A. 



1 1 



7. We noted in Chapter 5 that since matrix algebra is not commutative under multiplication, certain difficulties arise. Let A = I ^ ^ | and 



B. 



'0 
,0 2 



(a) Compute e'^ , , and e^^^ . Compare c^c^ , f^e'* and e''+* . 

(b) Show that if is the 2 x 2 zero matrix, then = 1. 

(c) Prove that if A and B are two matrices that do commute, then e^'^^ = , thereby proving that and commute. 

(d) Prove that for any matrix A, (e^Y^ = e"'*. 

8. Another observation for adjacency matrices: For the matrix in Example 12.5.2, note that the sum of the elements in the row corresponding 
to the node a (that is, the first row) gives the outdegree of a. Similarly, the sum of the elements in any given column gives the indegree of the 
node corresponding to that column. 




Figure 12.5.3 

(a) Using the matrix A of Example 12.5.2, find the outdegree and the indegree of each node. Verify by the graph. 

(b) Repeat part (a) for the directed graphs in Figure 12.5.3. 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 12 



Section 12.1 

1 . Find all solutions of the following systems: 



(a) 2j;i - 2j;2 + = 

Xi + X2 + X-j 

2. Find all solutions of 



1 

: 
3 



(b) xi - xj = 

2xi - 4x2 =1 

-Xi + X2 - JTa = -1 



Xi ~ X2 + 2 X3 

3xi + X3 

2xi + X2 - X3 



Section 12.2 

3. Determine using the method of the text if 







f 1 


2 


1 




A 




-2 


-3 




1 






, 1 


4 


4 




Find the inverse of the matrix 










ro 


-4 


1 


■ 






1 


















1 












.0 








4, 





Section 12.3 

5. In this exercise, write elements of in column form. Let {xi, X2) be a basis in R^. Prove that {Axi, Ax2 } is a basis for if and only if A 
has an inverse. 

6. Let V = (/' : X -» R), where X is any nonempty set. Show that V is a vector space under the operations: 

(/+ g)ix) = fix) + gix) for/, ^ e V,andx e X 
(cf)ix) = cf(x)ioxf e V,c e R,and;c e X. 

7. (a) Convince yourself that M2x3(Z2) is a vector space over Z2 (i.e., allow only scalars from Z2 and use mod 2 arithmetic). 

(b) What is the vector -X, for any X e M2y.iiI.2V- 

(c) What is |M2x3(Z2)|? 

8. (a) Define operations on R so that R is a vector space over R. 

(b) What is a basis for the vector space part a? What is its dimension? 



Section 12.4 

r 0.6 0.2 1 

9. Employ the diagonalization process to approximate the 100 power of A, where ^^[q^ osi' 

fO ~ 





10. LetB = 



5 
3 

-6 ) 



and C: 



M 2 2 
2 4 2 
.2 2 4 J 



\0 6 

(a) Find aU of the eigenvalues of B. 

(b) Given that 2 and 8 are the only eigenvalues of C, find invertible matrix P and diagonal matrix D such that C = PDP'^ 
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(0 1 


\ 




M 





^ 


11. Let A = 


1 


1 


and B = 





4 


1 




.0 1 


, 




.0 





2 , 



(a) Find aU of the eigenvalues of A. 

(b) Given that 4 and 2 are the only eigenvalues of B, find invertible matrix P and diagonal matrix D such that B = PDP~^ . 
12. Find all eigenvalues and associated eigenvectors of the matrix A, and write A in the form A = PDP~^ . 

-3 1 



r 1 

Lo 2 J 



Section 12.5 

13. For a multigraph we can define its matrix representation as follows: Ajj = the number of different edges e from vertex a; to vertex Uj. 

(a) Draw the digraph that is described by the following matrix: 

f 2 1 ^ 

A= 1 3 
,110 

(b) Determine A^ and interpret the result using Theorem 12.5.1 . 
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Chapter 13 




BOOLEAN ALGEBRA 




George Boole, 1815 - 1864 



George Boole 

George Boole wasn't idle a lot. 

He churned out ideas on the spot, 

Making marvellous use of 

Inclusive/exclusive 

Expressions like AND, OR, and NOT 

- limerick by Andrew Robinson from the Omnificent English Dictionary In Limerick Form 

GOALS 

In this chapater we will develop an algebra that is particularly important to computer scientists, as it is the mathematical foundation of computer 
design, or switching theory. The similarities of Boolean algebra and the algebra of sets and logic will be discussed, and we will discover special 
properties of finite Boolean algebras. 

In order to achieve these goals, we will recall the basic ideas of posets introduced in Chapter 6 and develop the concept of a lattice, which has 
applications in finite-state machines. The reader should view the development of the topics of this chapter as another example of an algebraic 
system. Hence, we expect to define first the elements in the system, next the operations on the elements, and then the common properties of the 
operations in the system. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



Chapter 13 - Boolean Algebra 

1 13.1 Posets Revisited 

From Chapter 6, Section 3, we recall the following definition: 

Definition: Poset. A set L on which a partial ordering relation (reflexive, antisymmetric, and transitive) r is deflned is called a partially 

ordered set, or poset, for short. 

We recall a few examples of posets: 

(1) L = R and r is the relation <. 

(2) L = PiA) where A = {a, b] and r is the relation c. 

(3) L = (1, 2, 3, 6) and r is the relation I (divides). We remind the reader that the pair (a, b) as an element of the relation r can be 
expressed as (a, h) er,oi arh, depending on convenience and readability. 

The posets we will concentrate on in this chapter will be those which have maxima and minima. These partial orderings resemble that of < on 
R, so the symbol < is used to replace the symbol r in the definition of a partially ordered set. Hence, the definition of a poset becomes: 

Definition: Poset. A set on which a partial ordering, < , is defined is called a partially ordered set, or, in brief, a poset. Here, < is a 
partial ordering on L if and only if for all a, b, c £ L: 

(1) a < a(reflexivity) , 

(2) a < h and b < a ^ a = b {antisymmetry) , and 

We now proceed to introduce maximum and minimum concepts. To do this, we will first define these concepts for two elements of the poset 
L, and then define the concepts over the whole poset L. 

Definition: Lower Bound, Upper Bound. Let a, b e L,a poset. Then c e Lis a lower bound of a and bifc < a and c < b.d e Lis 
an upper bound of a and b if a < d and b < d. 

Definition: Greatest Lower Bound. Let L be a poset and < be the partial ordering on L. Let a, b e L, then g e L is a greatest 
lower bound of a and b, denoted glb(a, b), if and only if 

• g ^ a, 

• g £ b, and 

• ifg ' E L such that ifg ' < a and g' < b, then g' < g. 

The last condition says, in other words, that if g ' is also a lower bound, then g is "greater" than g ', so g is a greatest lower bound. 

The definition of a least upper bound is a mirror image of a greatest lower bound: 

Definition: Least Upper Bound. Let Lbe a poset and < be the partial ordering on L. Let a, b e L, then f e Lisa least upper bound 
of a and b, denoted lubia, b), if and only if 

• a < f , 

• b < f, and 

• iff E Lsuchthatifa </" and b </", then /' < f\ 

Notice that the two definitions above refer to "...a greatest lower bound" and "a least upper bound." Any time you define an object like these 
you need to have an open mind as to whether more than one such object can exist. In fact, we now can prove that there can't be two greatest 
lower bounds or two least upper bounds. 

Theorem 13.1.1. Let L he a poset and < he the partial ordering on L, and a, b e L. If a greatest lower bound of a and b exists, then it 
is unique. The same is true of a least upper bound, if it exists. 

Proof: Let g and g ' be greatest lower bounds of a and b . We wiU prove that g = g'. 

(1) g a greatest lower bound of a and i => g is a lower bound of a and fo. 

(2) g' a greatest lower bound of a and b and g a lower bound of a and b g < g ' by the definition of greatest lower bound. 

(3) g'agreatestlowerboundof aand^ g 'is a lower bound of a and 

(4) g a greatest lower bound of a and b and g' a lower bound of a and b => g ' < g by the definition of greatest lower bound. 

(5) g £ g ' and g'<g => g = g' by the antisymmetry property of a partial ordering. 

The proof of the second statement in the theorem is almost identical to the first and is left to the reader. ■ 

Definition: Greatest Element, Least Element. Let L be a poset. M e L is called the greatest (maximum) element of L if , for all 

a E L, a < M . In addition, m E L is called the least (minimum) element of L if for all a E L, m < a. 

Note: The greatest and least elements, when they exist, are frequently denoted by 1 and respectively. 
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Example 13.1.1. Let L = {1, 3, 5, 7, 15, 21, 35, 105} and let < be the relation I (divides) on L. Then L is a poset. To determine the 
lub of 3 and 7, we look for all f e L, such that 3 | / and 7 | /. Certainly, both / = 21 and / = 105 satisfy these conditions and no other 
element of L does. Next, since 21 | 105, then 21 = lub(3, 7). Similarly, the luh(3, 5) = 15. The greatest element of L is 105 since a | 105 for 
all a e L. To find the gib of 15 and 35, we first consider all elements g of L such that g I 15 and g I 35. Certainly, both g = 5 and g = I 
satisfy these conditions. But since 1 | 5, then gib (15, 35) = 5. The least element of L is 1 since 1 | a for all a e L. 

Henceforth, for any positive integer n, D„ will denote the set of all positive integers which are divisors of n. For example, the set L of 
Example 13.1.1 isDjos. 

Example 13.1.2. Consider the poset PiA), where A = [a, b, c), with the relation c on ?'(A). The gib of the {a, b] and |a, c) is 
g = [a]. For any other element g' of M which is a subset of |a, b] and |a, c) (there is only one; what is it?), g' c g. The least element of 
T{A) is and the greatest element of ?'(A) is A = |a, b, c). The Hasse diagram of TiA) is shown in Figure 13.1.1. 




Figure 13.1.1 
Example 13.1.2 

With a little practice, it is quite easy to find the least upper bounds and greatest lower bounds of all possible pairs in P (A) directly from the 
graph of the poset. 

The previous examples and definitions indicate that the lub and gib are defined in terms of the partial ordering of the given poset. It is not yet 
clear whether all posets have the property such every pair of elements has both a lub and a gib. Indeed, this is not the case (see Exercise 3). 

EXERCISES FOR SECTION 13.1 
A Exercises 

1. Let £)3o = |1, 2, 3, 5, 6, 10, 15, 30) and let the relation I be a partial ordering on 1)30. 

(a) Find all lower bounds of 10 and 15. 

(b) Findthegtoof lOand 15. 

(c) Find all upper bounds of 10 and 15. 

(d) Determine the lub of 10 and 15. 

(e) Draw the Hasse diagram for 1)30 with I. Compare this Hasse diagram with that of Example 13.1.2. Note that the two diagrams are 
structurally the same. 

2. List the elements of the sets Dg,D^, and fiooi ■ For each set, draw the Hasse diagram for "divides." 

3. Figure 13.1 .2 contains Hasse diagrams of posets. 

(a) Determine the lub and gib of all pairs of elements when they exist. Indicate those pairs that do not have a lub (or a gib). 

(b) Find the least and greatest elements when they exist. 
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(e) (f) (g) (h) 




Figure 13.1.2 
Exercise 3 

4. For the poset (f^, <), what are glh(a, b) and lub(a, b)l Are there least and/or greatest elements? 

5. (a) Prove the second part of Theorem 13.1.1, the least upper bound of two elements in a poset is unique, it one exists, 
(b) Prove that if a poset L has a least element, then that element is unique. 

6. We naturally order the numbers in A„, = {1, 2, . . . , m] with "less than or equal to," which is a partial ordering. We may order the 
elements of A,„ x A„ by (a, b) < (a \ b') a < a' and b < b'. 

(a) Prove that this defines a partial ordering of A„, x A„ . 

(b) Draw the ordering diagrams for < on A2 x A2 , A2 x A3 , and A3 x A3 . 

(c) What are gft ((a, b), (a\ b')) and lub((a, b), (a', b'))l 

(d) Are there least and/or greatest elements in A,„ x A„? 

I 13.2 Lattices 

In this section, we restrict our discussion to lattices, those posets where every pair of elements has a lub and a gib. We first introduce some 
notation. 

Definitions: Join, Meet. Let Lbe a poset under an ordering < . Let a, b £ L. We define: 

a y b (read "a join b") as the least upper bound of a and b, and 

a f\ b (read "a meet b") as greatest lower bound of a and b. 

Since the join and meet operations produce a unique result in all cases where they exist, by Theorem 13.1.1, we can consider them as binary 
operations on a set if they aways exist. Thus the following definition: 

Definition: Lattice. A lattice is a poset L (under < ) in which every pair of elements has a lub and a gib. Since a lattice L is an algebraic 
system with binary operations V and A , it is denoted by [L; V. A]- 

In Example 13.1.2, the operation table for the lub operation is easy, although admittedly tedious, to do. We can observe that every pair of 
elements in this poset has a least upper bound. In fact, A\J B = A \J B. 

The reader is encouraged to write out the operation table for the gib operation and to note that every pair of elements in this poset also has a 
gib, so that f (A) together with these two operations is a lattice. We further observe that: 

(1) [f (A); V , A ] is a lattice (under c ) for any set A, and 

(2) the join operation is the set operation of union and the meet operation is the operation intersection; that is, V =U''iid A = fl- 
it can be shown (see the exercises) that the commutative laws, associative laws, idempotent laws, and absorption laws are all true for any 
lattice. An example of this is clearly YP(A); IJ, {^}, since these laws hold in the algebra of sets. This lattice is also distributive in that join is 
distributive over meet and meet is distributive over join. This is not always the case for lattices in general however. 
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Definition: Distributive Lattice. Let [L; V , A] be a lattice (under <j. [L; V . A I w called a distributive lattice if and only if the distribu- 
tive laws hold: that is, for all a, b, c e L, we have: 

a W {b /\ c) = (a y b) A (a V c) and 
a A {b y c) = (a A b) y (a A c). 
Example 13.2.1. If A is any set, the lattice [f (A); |J> HI is distributive. 

Example 13.2.2. We now give an example of a lattice where the distributive laws do not hold. Let L = |1, 2, 3, 5, 30}. Then L is a poset 
under the relation divides. The operation tables for V and A on L are: 



V 


1 


2 


3 


5 30 


1 


1 


2 


3 


5 30 


2 


2 


2 


30 


30 30 


3 


3 


30 


3 


30 30 


5 


5 


30 


30 


5 30 


30 


30 


30 


30 


30 30 


A 


1 


2 


3 


5 30 


1 


1 


1 


1 


1 1 


2 


1 


2 


1 


1 2 


3 


1 


1 


3 


1 3 


5 


1 


1 


1 


5 5 


30 


1 


2 


3 


5 30 



Since every pair of elements in L has both a join and a meet, [L; V , AJ is a lattice (under divides). Is this lattice distributive? We note that: 

2 V (5 A 3) = 2 V 1 = 2 and 

(2 V 5) A (2 V 3) = 30 A 30 = 30, 
so that a y (b A c) t {a y b) A (o y c) for some values of a, b, c g L. Hence L is not a distributive lattice. 
It can be shown that a lattice is nondistributive if and only if it contains a sublattice isomorphic to one of the lattices in Figure 13.2.1 . 



1 




Figure 13.2.1 
Nondistributive lattices 

It is interesting to note that for the relation "divides" on P, if a, e IP we have: 

a y b = Icm {a, b), the least common multiple of a and b; that is, the smallest integer (in P) that is divisible by both a and b; 
a A b = gcd{a, b), the greatest common divisor of a and b; that is, the largest integer that divides both a and b. 

EXERCISES FOR SECTION 13.2 
A Exercises 

1 . Let L be the set of all propositions generated by p and q. What are the meet and join operations in this lattice. What are the maximum and 
minimum elements? 

2. Which of the posets in Exercise 3 of Section 13.1 are lattices? Which of the lattices are distributive? 
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B Exercises 

3 . (a) State the commutative laws, associative laws, idempotent laws, and absorption laws for lattices. 

(b) Prove these laws. 

4. Let [L; V , A J be a lattice based on a partial ordering < . Prove that if a, c eL, 

(a) a y h > a. 

(b) a A b < a. 

(c) a > b and a>c=>a>b\/c. 

1 13.3 Boolean Algebras 

In order to define a Boolean algebra, we need the additional concept of complementation. 

Definition: Complemented Lattice. Let [L; V> A] be a lattice that contains a least element, 0, and a greatest element, 1 . [L; V. A] w called a 
complemented lattice if and only if for every element a eL, there exists an element a in L such that a f\a = and aV a = 1. Such an element a 
is called a complement of the element a. 

Example 13.3.1. Let L = PiA), where A = {a, b, c). Then [L; |J. fl is a bounded lattice with = and 1 = A. Then, to find if it 
exists, the complement, B, of, say B = [a, b] e L, we want B such that 

(a, fo) n = and {a,b][JB = A. 

Here, B = [c], and since it can be shown that each element of L has a complement (see Exercise 1), [L; IJ> HI is a complemented lattice. 
Note that if A is any set and L = P(_A), then [L; U > fl is a complemented lattice where the complement ofBeI,isB = B'^ = A-B. 

In Example 13.3.1, we observe that the complement of each element of L is unique. Is this always the case? The answer is no. Consider the 
following. 

Example 13.3.2. Let L = |1, 2, 3, 5, 30) and consider the lattice [L; V> A] (under "divides"). The least element of Z. is 1 and the 
greatest element is 30. Let us compute the complement of the element a = 2. We want to determine a such that 2 /\ a = 1 and 2 V a = 30. 
Certainly, a = 3 works, but so does a = 5, so the complement of a = 2 in this lattice is not unique. However, [L; V > A] is stiU a comple- 
mented lattice since each element does have at least one complement. 

The following theorem gives us an insight into when uniqueness of complements occurs. 
Theorem 133.1. If[L; V> /Wis a complemented and distributive lattice, then the complement a of any element a eLisunique. 
Proof: Let a e L and assume to the contrary that a has two complements, namely ai and 02. Then by definition of complement, 
a A ai = Ganda V ai = 1, 

Also, 

a A 02 = and a V 02 = 1. 

So that 

oi = ai A 1 = oi A (a V 02) 
= (ai A a) V (ai A 02) 
= V (ai A 02) 
= fli A 02. 

On the other hand, 

^2 = 02 A 1 = "2 A (a V Oi) 
= (02 A a) V (02 A ai) 
= V (02 A ai) 
= a2 A O] . 

Hence ai = 02 , which contradicts the assumption that a has two different complements, ai and 02- ■ 

Definition: Boolean Algebra. A Boolean algebra is a lattice that contains a least element and a greatest element and that is both 
complemented and distributive . 

Since the complement of each element in a Boolean algebra is imique (by Theorem 13.3.1), complementation is a valid unary operation over 
the set under discussion, and we will list it together with the other two operations to emphasize that we are discussing a set together with three 
operations. Also, to help emphasize the distinction between lattices and lattices that are Boolean algebras, we wiU use the letter B as the generic 
symbol for the set of a Boolean algebra; that is, [B; -, V , A] will stand for a general Boolean algebra. 
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Example 13.3.3. Let A be any set, and let B = P(A). Then [B; c, U , HI is a Boolean algebra. Here, c stands for the complement of an 
element of B with respect to A, A - B. 

This is a key example for us since all finite Boolean algebras and many infinite Boolean algebras look like this example for some A. In fact, a 
glance at the basic Boolean algebra laws in Table 13.3.1, in comparison with the set laws of Chapter 4 and the basic laws of logic of Chapter 3, 
indicates that all three systems behave the same; that is, they are isomorphic. 

The "pairing" of the above laws reminds us of the principle of duaUty, which we state for a Boolean algebra. 

Definition: Principle of Duality for Boolean Algebras. Let [B; -, V , A] he a Boolean algebra (under <), and let S be a true statement for 
[B; -, V . A]- IfS* is obtained from S by replacing < by > (this is equivalent to turning the graph upside down), M by A, A by V, by 1, and 
1 by 0, then S* is also a true statement. 

TABLE 13.3.1 
Basic Boolean Algebra Laws 



Commutative Laws 
\. aM b = b\j a \: a f\b = b l\ a 



Associative Laws 

2. ay {by c) = {ay b)y c i: a t\{b t\c) = {a t\b) t\c 



Distributive Laws 

3. a A {b y c) = {a A b) y {a A c) 3.' a y {b A c) = {a y b) A {a V c) 



Identity Laws 

4. aVO = OVa = a 4.'aAl=lAa = a 



Complement Laws 
5. a V a = 1 S: a Aa = 



Idempotent Laws 
6. a y a = a 6.' a A a = a 



NuU Laws 

7. a V 1 = 1 7.' a A = 



Absorption Laws 

8. ay (a A b) ^ a 8.' a A ((' y h) = a 



DeMorgan's Laws 
9. ay b = aAb 9.' a Ab = aMb 

Involution Law 
10. a = a 

Example 133.4. The laws 1' through 9' are the duals of the Laws 1 through 9 respectively. Law 10 is its own dual. 
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We close this section with some comments on notation. The notation for operations in a Boolean algebra is derived from the algebra of logic. 
However, other notations are used. These are summarized in the following chart; 



Notation used in this text 
(Mathematics notation) 


Set Notation 


Logic Design 

(CS/EE notation) 


Read as 


V 


u 


e 


join 


A 


n 


® 


meet 




c 




complement 


< 


c 


< 


underlying partial ordering 



Mathematicians most frequently use the notation of the text, and, on occasion, use set notation for Boolean algebras. Thinking in terms of sets 
may be easier for some people. Computer designers traditionally use the arithmetic and notation. In this latter notation, DeMorgan's Laws 
become: 

(9) a ® b = a ®b 

and 

(9') a®T = a®b. 

EXERCISES FOR SECTION 13.3 
A Exercises 

1. Determine the complement of each element B e L in Example 13.3.1. Is this lattice a Boolean algebra? Why? 

2. (a) Determine the complement of each element of Dg in [Dg; V> A]- 

(b) Repeat part a using the lattice in Example 13.2.2. 

(c) Repeat part a using the lattice in Exercise 1 of Section 13.1. 

(d) Are the lattices in parts a, b, and c Boolean algebras? Why? 

3. Determine which of the lattices of Exercise 3 of Section 13.1 are Boolean algebras. 

4. LetA = (a, fojandB = n^). 

(a) Prove that [6; c, |J, PJ is a Boolean algebra. 

(b) Write out the operation tables for the Boolean algebra. 

5. It can be shown that the following statement, S, holds for any Boolean algebra [B; -, V , /\\ : (a /\ b) = aH a ■< b. 

(a) Write the dual, S', of the statement S. 

(b) Write the statement S and its dual, S', in the language of sets. 

(c) Are the statements in part b true for all sets? 

(d) Write the statement S and its dual, S*, in the language of logic. 

(e) Are the statements in part d true for all propositions? 

6. State the dual of: 

(a) a \J {b t\ a) = a. 

(b) a V ((F V a) A b) = 1. 

(c) (a A b) f\ b = a y b. 
B Exercises 

7 . Formulate a definition for isomorphic Boolean algebras . 

\ 13.4 Atoms of a Boolean Algebra 

In this section we will look more closely at previous claims that every finite Boolean algebra is isomorphic to an algebra of sets. We will show 
that every finite Boolean algebra has 2" elements for some n with precisely n generators, called atoms. 

Consider the Boolean algebra [B; -, V> A]> whose graph is: 
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Figure 13.4.1 
Illustration of the atom concept 

We note that 1 = ai V 02 V 03, = a] V 02. ^2 = oi V 03, and = 02 y a^; that is, each of the elements above level one can be 
described completely and uniquely in terms of the elements on level one. The a, s have uniquely generated the nonzero elements of B much like 
a basis in linear algebra generates the elements in a vector space. We also note that the a; s are the immediate successors of the minimum 
element, 0. In any Boolean algebra, the immediate successors of the minimum element are called atoms. Let A be any nonempty set. In the 
Boolean algebra ['P(Ay, c, |J, P|] (over c), the singleton sets are the generators, or atoms, of the algebraic structure since each element PiA) 
can be described completely and uniquely as the join or union of singleton sets. 

Definition: Atom. A nonzero element a in a Boolean algebra [B; -, V , AJ is called an atom if for every xeB,xl\a=aor 
X /\ a = 0. 

The condition that x A a = a tells us that x is a successor of a; that is, a < x, as depicted in Figure 13.4.2a. 

The condition x A a = is true only when x and a are "not connected." This occurs when x is another atom or if x is a successor of atoms 
different from a, as depicted in Figure 13.4.2b. 



X 




X 



a 

(a) (b) 
Figure 13.4.2 

Example 13.4.1. The set of atoms of the Boolean algebra [Dt,q; -, V , /\] is M = |2, 3, 5|. To see that a = 2 is an atom, let x be any 
nonzero element of D30 and note that one of the two conditions xA2 = 2orxA2 = 1 holds. Of course, to apply the definition to this 
Boolean algebra, we must remind ourselves that in this case the 0-element is 1, the operation A is gcd, and the poset relation < is "divides." So 
if X = 10, we have 10 A 2 = 2 (or 2 I 10), so Condition 1 holds. If x = 15, the first condition is not true. (Why?) However, Condition 2, 
15 A 2 = 1, is true. The reader is encouraged to show that each of the elements 2, 3, and 5 satisfy the definition (see Exercise 13.4.1). Next, if 
we compute the join (Icm in this case) of all possible combinations of the atoms 2, 3, and 5, we will generate all nonzero elements of 1)30. For 
example, 2 V 3 V 5 = 30 and 2 V 5 = 10. We state this concept formally in the following theorem, which we give without proof. 

Theorem 13.4.1. Let[B; -, V, t\\ be any finite Boolean algebra. Let A = [aj, a2 a „] be the set of all n atoms of [B; -, V, A]. 

Then every nonzero element in B can be expressed uniquely as the join of a subset of A. 

We now ask ourselves if we can be more definitive about the structure of different Boolean algebras of a given order. Certainly, the Boolean 
algebras [1)30; -, V . A] and YP(A)\ c, IJ- fl have the same graph (that of Figure 13.4.1), the same number of atoms, and, in all respects, 
look the same except for the names of the elements and the operations. In fact, when we apply corresponding operations to corresponding 
elements, we obtain corresponding results. We know from Chapter 11 that this means that the two structures are isomorphic as Boolean 
algebras. Furthermore, the graphs of these examples are exactly the same as that of Figure 13.4.1, which is an arbitrary Boolean algebra of 
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order 8 = 2^ . 

In these examples of a Boolean algebra of order 8, we note that each had 3 atoms and 2^ = 8 number of elements, and all were isomorphic to 
[P(_A); c, U> n.whereA = {a, b, c}. This leads us to the following questions: 

(1) Are there any other different (nonisomorphic) Boolean algebras of order 8? 

(2) What is the relationship, if any, between finite Boolean algebras and their atoms? 

(3) How many different (nonisomorphic) Boolean algebras are there of order 2? Order 3? Order 4? And so on. 

The answers to these questions are given in the following theorem and corollaries. We include the proofs of the corollaries since they are 
instructive. 

Theorem 13.4.2. Let [B; -, V , A] be any finite Boolean algebra, and let A be the set of all atoms in this Boolean algebra. Then 
[B; - V, h] is isomorphic to [P{A); c, U. fH- 

Corollary 134.1. Every finite Boolean algebra [B ; -, V, l\\ has 2"^ elements for some positive integer n. 

Proof: Let A be the set of all atoms of B and let \A\ = n. Then there are exactly 2" elements (subsets) in P(A), and by Theorem 13.4.2, 
[B; -, V, A] is isomorphic to [•P(A); c, U, HI- ■ 

Corollary 13.4.2. All Boolean algebras of order 2" are isomorphic to each other. (The graph of the Boolean algebra of order 2" is the n-cube). 

Proof: By Theorem 13.4.2, every Boolean algebra of order 2" is isomorphic to [PiA); c, \J, fH when \A\ = n. Hence, they are all isomor- 
phic to one another. ■ 

The above theorem and corollaries tell us that we can only have finite Boolean algebras of orders 2^ 2^, 2^, . . . , 2", and that all finite 
Boolean algebras of any given order are isomorphic. These are powerful tools in determining the structure of finite Boolean algebras. In the 
next section, we will try to find the easiest way of describing a Boolean algebra of any given order. 

EXERCISES FOR SECTION 13A 

A Exercises 

1. (a) Show that a = 2 is an atom of the Boolean algebra [D30; -, V. A]- 

(b) Repeat part a for the elements 3 and 5 of D30. 

(c) Verify Theorem 13.4.1 for the Boolean algebra [D30; -, V , A]- 

2. Let A = {a, b, c]. 

(a) Rewrite the definition of atom for [?'(A); c, U> fl- What does a < x mean in this example? 

(b) Find all atoms of [^(A); c, U, fH- 

(c) Verify Theorem 13.4.1 for [^(A); c, U, H- 

3. Verify Theorem 13.4.2 and its corollaries for the Boolean algebras in Exercises 1 and 2 of this section. 

4. Give a description of all Boolean algebras of order 16. (Hint: Use Theorem 13.4.2.) Note that the graph of this Boolean algebra is given in 
Figure 9.4.5. 

5. Corollary 13.4.1 states that there do not exist Boolean algebras of orders 3, 5, 6, 7, 9, etc. (orders different from 2"). Prove that we cannot 
have a Boolean algebra of order 3. (Hint: Assume that [B; -, V . A] is a Boolean algebra of order 3 where B = {0, x, 1) and show that this 
carmot happen by investigating the possibihties for its operation tables.) 

6. (a) There are many different, yet isomorphic. Boolean algebras with two elements. Describe one such Boolean algebra that is derived from 
a power set, f (A), under c. Describe a second that is described from D„, for some n e P, under "divides." 

(b) Since the elements of a two-element Boolean algebra must be the greatest and least elements, 1 and 0, the tables for the operations on 
{0, 1} are determined by the Boolean algebra laws. Write out the operation tables for [{0, 1); -, V> A]- 

B Exercises 

7. Find a Boolean algebra with a countably infinite number of elements. 

8. Prove that the direct product of two Boolean algebras is a Boolean algebra. (Hint: "Copy" the corresponding proof for groups in Section 
11.6.) 
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1 13.5 Finite Boolean Algebras as n-tuples of O's and 1's 

From the previous section we know that all finite Boolean algebras are of order 2", where n is the number of atoms in the algebra. We can 
therefore completely describe every finite Boolean algebra by the algebra of power sets. Is there a more convenient, or at least an alternate way, 
of defining finite Boolean algebras? In Chapter 1 1 we found that we could produce new groups by taking Cartesian products of previously 
known groups. We imitate this process for Boolean algebras. 

The simplest nontrivial Boolean algebra is the Boolean algebra on the set B2 = (0, 1). The ordering on B2 is the natural one, 
OsO, 0^1, 1^1. If we treat and 1 as the truth values "false" and "true," respectively, we see that the Boolean operations 
V (join) and A (meet) are nothing more than the logical connectives V (or) and A (and). The Boolean operation, -, (complementation) is the 
logical (negation). In fact, this is why the symbols -, V , and A were chosen as the names of the Boolean operations. The operation tables 
for YB2\ -, V . A] are simply those of "or," "and," and "not," which we repeat here: 



V 


1 


A 


1 


u 


u 





1 








0" 


T 


1 


1 1 


1 


1 


1 






By Theorem 13.4.2 and its corollaries, all Boolean algebras of order 2 are isomorphic to this one. 

We know that if we form 62x62 = B2 we obtain the set {(0, 0), (0, 1), (1, 0), (1, 1)), a set of order 4. We define operations on B2 the 
natural way, namely, componentwise, so that (0, 1)V(1, l) = (OVl, 1V1) = (1, 1), (0, 1)A(1, l) = (OAl, 1A1) = (0, l)and(07T) = 

(0, T) = (1, 0). We claim that B\ is a Boolean algebra under the componentwise operations. Hence, [B\; -, V , A] is a Boolean algebra of 

order 4. Since all Boolean algebras of order 4 are isomorphic to each other, we have found a simple way of describing all Boolean algebras of 
order 4. 

It is quite clear that we can describe any Boolean algebra of order 8 by considering B2XB2XB2 =62 and, in general, any Boolean algebra of 
order 2"— that is, all finite Boolean algebras— by =62x62 x •••B2 (n factors). 

EXERCISES FOR SECTION 13.5 
A Exercises 

1. (a) Write out the operation tables for [B2', -, V > A]- 

(b) Draw the Hasse diagram for [Bj; V, A] and compare your results with Figure 9.4.6. 

(c) Find the atoms of this Boolean algebra. 

2. (a) Write out the operation table for [B|; -, V, A]- 

(b) Draw the Hasse diagram for [B|; -, V> A] atid compare the results with Figure 9.4.6. 

3. (a) List all atoms of Bj. 

(b) Describe the atoms of B^ n ? 1 . 

B Exercise 

4. Theorem 13.4.2 tells us we can think of any finite Boolean algebra in terms of sets. In Chapter 4, Section 3, we defined the terms minset and 
minset normal form. Rephrase these definitions in the language of Boolean algebra. The generalization of minsets are called minterms. 

1 13.6 Boolean Expressions 

In this section, we wiU use our background from the previous sections and set theory to develop a procedure for simplifying Boolean expres- 
sions. This procedure has considerable application to the simplification of circuits in switching theory or logical design. 

Definition: Boolean Expression. Let [B; -, V , A] be any Boolean algebra. Let Xj, X2, x^- be variables in B; that is, variables 
that can assume values from B. A Boolean expression generated by Xj, X2, X/^ is any valid combination of the x,- and the elements of B 
with the operations of meet, join, and complementation. 

This definition, as expected, is the analog of the definition of a proposition generated by a set of propositions, presented in Section 3.2. 

Each Boolean expression generated by k variables, e(xi, x^.), defines a function / : B'' ^ B where /(oi, a^) = e(ai, a^). If B is a 

finite Boolean algebra, then there are a finite number of functions from B* into B. Those functions that are defined in terms of Boolean 
expressions are called Boolean functions. As we will see, there is an infinite number of Boolean expressions that define each Boolean function. 
Naturally, the "shortest" of these expressions wiU be preferred. Since electronic circuits can be described as Boolean fimctions with B = B2 , 
this economization is quite useful. 
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Example 13.6.1. Consider any Boolean algebra [5; -, V . A] of order 2. How many functions / : -> B are there? First, all 
Boolean algebras of order 2 are isomorphic to [B2; -, V > A] so we want to determine the number of functions / : B2 -» B2. If we consider 
a Boolean function of two variables, xi and X2, we note that each variable has two possible values and 1, so there are 2^ ways of assigning 
these two values to the A: = 2 variables. Hence, the table below has 2^ = 4 rows. So far we have a table such as that labeled 13.6.1 . 



X, 


X2 


f(Xl, X2) 








? 





1 


? 


1 





? 


1 


1 


? 



Table 13.6.1 

General Form Of Boolean Function /(xi, X2)ofExample 13.6.1 

How many possible different function values /(xi, X2) can there be? To list a few:/i(xi, X2)=xi, fjixi, X2) = X2, fi(x\, X2) = xi Vx2, 
U(xi, X2) = {xi l\xi)\/ X2 , fi(xi, X2) = xi A X2 V X2 , etc . Each of these wiU give a table like that of Table 1 3 .6 . 1 . The tables for /i , and /a 
appear in Table 13.6.2. 



Xi 


X2 


f\{Xx,X2) 


Xi 


X2 


h{Xx,X2) 























1 
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1 


1 





1 


1 
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1 


1 


1 


1 


1 


1 



Table 13.6.2 

Boolean Functions /j and fj, of Example 13.6.1 



Two functions are different if and only if their tables (values) are different for at least one row. Of course by using the basic laws of Boolean 
algebra we can see that fi= U- Why? So if we simply list by brute force all "combinations" of xi andx2 we will obtain unnecessary duplica- 
tion. However, we note that for any combination of the variables xi, and X2 there are only two possible values for /(xi, X2), namely or 1. 
Thus, we could write 2* = 16 different functions on 2 variables. 

Now let's count the number of different Boolean functions in a more general setting. We wiU consider two cases: first, when B = B2 , and 
second, when B is any finite Boolean algebra with 2" elements. 

Let B = B2. Each function / : B* ^ B is defined in terms of a table having 2* rows. Therefore, since there are two possible images for each 
element of B*, there are 2 raised to the 2*, or 2^' different functions. We claim that every one of these functions is a Boolean function. 

Now suppose that |B| = 2" > 2. A function from B* into B can still be defined in terms of a table. There are |B|* rows to each table and |B| 

possible images for each row. Therefore, there are 2" raised to the power 2"* different functions. If n > 1, then not every one of these functions 
is a Boolean function. Notice that in counting the numbers of functions we are applying the result of Exercise 5 of Section 7.1 . 

Since all Boolean algebras are isomorphic to a Boolean algebra of sets, the analogues of statements in sets are useful in Boolean algebras. 
Definition: Minterm. A Boolean expression generated by Xj, Xj, Xi^ that has the form 

k 

A yi, 

i=l 

where each yi may be either x, or x; is called a minterm generated by Xj, Xj, .... Xf^. 

By a direct application of the Product Rule we see that there are 2* different minterms generated by xi , . . . , Xf^. 

Definition: Minterm Normal Form. A Boolean expression generated by x,, Xj- is in minterm normal form if it is the join of 
expressions of the form a Km, where a eB and mis a minterm generated by Xj, x^-. That is, it is of the form 

V {aJ^mJ), 
J=' 

where p = 2'' and mj, nip are the minterms generated by Xj, X/, 

If B = B2, then each oy in a minterm normal form is either or 1 . Therefore, ay A nij is either or my. 

Ttieorem 13.6. 1. Let eixj, . . ., xi^) be a Boolean expression over B. There exists a unique minterm normal form M{xj, . . ., Xj^ that is 
equivalent to e(x/, . . ., Xj.) in the sense that e and M define the same function from B* into B. 

The uniqueness in this theorem does not include the possible ordering of the minterms in M (commonly referred to as "uniqueness up to the 
order of minterms"). The proof of this theorem would be quite lengthy, and not very instructive, so we will leave it to the interested reader to 
attempt. The implications of the theorem are very interesting, however. 

If |B| = 2", then there are 2" raised to the 2* different minterm normal forms. Since each different minterm normal form defines a different 
function, there are a like number of Boolean functions from B* into B. If B = B2, there are as many Boolean functions (2 raised to the 2*^) as 
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there are functions from B* into B, since there are 2 raised to the 2" functions from into B. The significance of this resuh is that any desired 
function can be obtained using electronic circuits having or 1 (off or on, positive or negative) values, but more complex, multivalued circuits 
would not have this flexibility. 

We will close this section by examining minterm normal forms for expressions over B2 , since they are a starting point for circuit 
economization. 

Example 13.6.2. Consider the Boolean expression /(xi , X2) = xi \l xj. One method of determining the minterm normal form of/ is to think 
in terms of sets. Consider the diagram with the usual translation of notation in Figure 13.6.1 . Then /(xi , xt) = hx2)\J {x\ AxJjVUi AX2). 




Figure 13.6.1 



Example 13.6.3. Consider the function f : B2 defined by Table 13.6.3. The minterm normal form for/can be obtained by taking the 
join of minterms that correspond to rows that have an image value of 1 . If f(a\ , ai, 03) = 1 , then include the minterm y\ A ^2 A J3 where 

f if a.; = 1 

\ Xj II Qj = 

TABLE 13.6.3 
Boolean Function of f(ai, 32, 33) Of Example 13.6.3 



ai 


3.2 


as 


f (ai, 3.2, as) 











1 








1 








1 
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1 


1 


1 











1 





1 





1 


1 





1 


1 





1 






Therefore, 

f(Xi, X2, X3) = (xf A X2 A X3) V (M 1^ X2 A X3) V (^1 A X2 A xJ). 

The minterm normal form is a first step in obtaining an economical way of expressing a given Boolean function. For functions of more than 
three variables, the above set theory approach tends to be awkward. Other procedures are used to write the normal form. The most convenient is 
the Karnaugh map, a discussion of which can be found in any logical design/switching theory text (see, for example. Hill and Peterson). 

EXERCISES FOR SECTION 13.6 

A Exercises 

1 . (a) Write the 16 possible functions of Example 13.6.1 . (Hint: Find all possible joins of minterms generated by xi and X2 .) 

(b) Write out the tables of several of the above Boolean functions to show that they are indeed different. 

(c) Determine the minterm normal form of 

/l(Xi, X2) =Xi VX2, 

/2(xi, X2) = xrvx^ 

/3(Xi, X2) =0, /4(Xi, X2) = 1. 

2. Consider the Boolean expression /(xj , xt, X3) = (X3 A X2)V(xrA X3)V(x2 A X3) on [B2; -, V, A]- 
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(a) Simplify this expression using basic Boolean algebra laws. 

(b) Write this expression in minterm normal form. 

(c) Write out the table for the given function defined by /and compare it to the tables of the functions in parts a and b. 

(d) How many possible different functions in three variables on [B2; V. A] are there? 
B Exercise 

3. Let [6; -, V. AJ be a Boolean algebra of order 4, and let/be a Boolean function of two variables on 5. 

(a) How many elements are there in the domain of/? 

(b) How many different Boolean functions are there of two, variables? Three variables? 

(c) Determine the minterm normal form of /(xj , X2) = Xi V X2. 

(d) If B = {0, a, b, 1), define a function from into B that is not a Boolean function. 

I 13.7 A Brief Introduction to the Application of Boolean Algebra to Switching Theory 

The algebra of switching theory is Boolean algebra. The standard notation used for Boolean algebra operations in most logic design/switching 
theory texts is + for V and • for A. Complementation is as in this text. Therefore, (xi A xT) V (^1 A X2) V A X2) becomes 
xi •X2 + xi • X2 + xi"«X2, or simply xi xj + X] X2 + xi"x2 . All concepts developed previously for Boolean algebras hold. The only change is 
purely notational. We make the change in this section solely to introduce the reader to another frequently used notation. Obviously, we could 
have continued the discussion with our previous notation. 

The simplest switching device is the on-off switch. If the switch is closed, on, current will pass through it; if it is open, off, current will not pass 
through it. If we designate on by true or the logical, or Boolean, 1 , and off by false, the logical, or Boolean, 0, we can describe electrical circuits 
containing switches by logical, or Boolean, expressions. The expression xi •X2 represents the situation in which a series of two switches 
appears in a circuit (see Figure 13.7. la). In order for current to flow through the circuit, both switches must be on, that is, have the value 1 . 




(a) m 

HGURE 13.7,1 

Similarly, a pair of parallel switches, as in Figure 13.7.1b, is described algebraically by xi + X2. Many of the concepts in Boolean algebra can 
be thought of in terms of switching theory. For example, the distributive law in Boolean algebra (in +, • notation) is:xi • (x2 + X3) = 
xi •X2 +xi 'Xi. Of course, this says the expression on the left is always equivalent to that on the right. The switching circuit analogue of the 
above statement is that Figure 13.7.2a is equivalent (as an electrical circuit) to Figure 13.7.2b. 

The circuits in a digital computer are composed of large quantities of switches that can be represented as in Figure 13.7.2 or can be thought of 
as boxes or gates with two or more inputs (except for the NOT gate) and one output. These are often drawn as in Figure 13.7.3. For example, 
the OR gate, as the name implies, is the logical/Boolean OR function. The on-off switch function in Figure 13.7.3a in gate notation is Figure 
13.7.3b. 
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FIGURE 13.7 J 



Either diagram indicates that the circuit will conduct current if and only if /(xi, X2, X3) is true, or 1. We list the gate symbols that are widely 
used in switching theory in Figure 13.7.4 with their names. The names mean, and are read, exactly as they appear. For example, NAND means 
"not xi and X2" or algebraically , A X2, or xi • X2. 

The circuit in Figure 13.7.5a can be described by gates. To do so, simply keep in mind that the Boolean function f(x], X2) = xi •X2 of this 
circuit contains two operations. The operation of complementation takes precedence over that of "and," so we have Figure 13.7.5b. 

ExamplG 1 3.7.1 . The switching circuit in Figure 13.7.6a can be expressed through the logic, or gate, circuit in Figure 13.7.6b. 
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FIQURE 13.7.4 



We leave it to the reader to analyze both figures and to convince him- or herself that they do describe the same circuit. The circuit can be 
described algebraically as 

/(Xi, X2, X3) = ((Xi + X2) + (Xi + X3))«Xi«X^. 

We can use basic Boolean algebra laws to simplify or minimize this Boolean function (circuit): 

/(Xi, X2, X3) = ((Xi + X2) + (Xi + X3))«Xi«X^ 

= (Xi + X2 + X3) • Xi • X2 

= (Xi • Xi • X2 + X2 • Xi • X2 + X3 • Xi • X2 

= Xl • X2 + • Xi + X3 • Xi • X? 
= X] • X2 + X3 'Xl • X2 
= Xl • (X2 + X2 •X3) 
= Xl -X^ • (1 + X3) 

The circuit for/may be described as in Figure 13.7.5. This is a less expensive circuit since it involves considerably less hardware. 
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The table for/is: 
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The Venn diagram that represents/is the shaded portion in Figure 13.7.7. From this diagram, we can read off the minterm normal form of/: 

/(Xl, X2, X3) = Xl 'X^ • X3" + Xl •X2* X3. 



Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



338 



Chapter 13 - Boolean Algebra 




Figure 13.7.7 



The circuit (gate) diagram appears in Figure 13.7.8. 

How do we interpret this? We see that f{xi, xj, x^) = 1 when xi = 1, X2=0, and X3 = or X3 = 1. Current will be conducted through the 
circuit when switch xi is on, switch xi is off, and when switch X3 is either off or on. 



^3' 



o 



JC| . Xi ' Xi 

FFSURI W-7.6 



We close this section with a brief discussion of minimization, or reduction, techniques. We have discussed two in this text: algebraic (using 
basic Boolean rules) reduction and the minterm normal form technique. Other techniques are discussed in switching theory texts. When one 
reduces a given Boolean function, or circuit, it is possible to obtain a circuit that does not look simpler, but may be more cost effective, and is, 
therefore, simpler with respect to time. We illustrate with an example. 

ExamplG 13.7.2. Consider the Boolean function of Figure 13.7.9a is f(xi, X2, xj, X4) = ((xj •X2)*X3)«X4, which can also be diagrammed 
as in Figure 13.7.9b. 



Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



Chapter 13 - Boolean Algebra 



339 



A> - 




X2 ■ 
Xi- 




m 

Is Circuit b simpler than Circuit a? Both circuits contain the same number of gates, so the hardware costs (per gate) would be the same. Hence, 
intuitively, we would guess that they are equivalent with respect to simplicity. However, the signals X3 and X4 in Circuit a pass through three 
levels of gating before reaching the output. All signals in Circuit b go through only two levels of gating (disregard the NOT gate when counting 
levels). Each level of logic (gates) adds to the time delay of the development of a signal at the output. In computers, we want the time delay to 
be as small as possible. Frequently, speed can be increased by decreasing the number of levels in a circuit. However, this frequently forces a 
larger number of gates to be used, thus increasing costs. One of the more difficult jobs of a design engineer is to balance off speed with 
hardware costs (number of gates). 

One final remark on notation: The circuit in Figure 13.7.10a can be written as in Figure 13.7.10b, or simply as in Figure 13.7.10c. 
EXERCISES FOR SECTION 13.7 

A Exercises 

1 . (a) Write all inputs and outputs from Figure 13.7.1 1 and show that its Boolean function is f{x] , X2, X3) = ((xi + X2) • x^) • (xi + X2). 

(b) Simplify / algebraically. 

(c) Find the minterm normal form of/. 

(d) Draw and compare the circuit (gate) diagram of parts b and c above. 

(e) Draw the on-off switching diagram of/in part a. 




Vi • Si 



(C) 

FIGURE 137.10 
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FIOUREIS.7.11 

(f) Write the table of the Boolean function/in part a and interpret the results. 
2. Given Figure 13.7.12: 




flGURE 13 7.13 

(a) Write the Boolean function that represents the given on-off circuit. 

(b) Show that the Boolean function obtained in answer to part a can be reduced to /(xi , X2) = X] . Draw the on-off circuit diagram of this 
simplified representation. 

(c) Draw the circuit (gate) diagram of the given on-off circuit diagram. 

(d) Determine the minterm normal of the Boolean function found in the answer to part a or given in part b; they are equivalent. 

(e) Discuss the relative simplicity and advantages of the circuit gate diagrams found in answer to parts c and d. 

3. (a) Write the circuit (gate) diagram of 

/(Xi, X2, X3) = (Xi«X2 + X3) • (X2 -I- X3) -I- X3. 

(b) Simplify the function in part a by using basic Boolean algebra laws. 

(c) Write the circuit (gate) diagram of the result obtained in part b. 

(d) Draw the on-off switch diagrams of parts a and b. 

4. Consider the Boolean function 

/(Xi, X2, X3, X4) = Xi -I- (X2 • (xf -I- X4) -I- X3 • (X2 -I- X4)). 

(a) Simplify /algebraically. 

(b) Draw the switching (on-off) circuit of/and the reduction of/obtained in part a. 

(c) Draw the circuit (gate) diagram of/and the reduction of/obtained in answer to part a. 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 13 
Section 13.1 

1 . (a) Draw the Hasse diagram of the relation divides on the set A = (1,2, 3, . . . , 12). 
(b) For the same set A draw the Hasse diagram for the relation < on A. 

2. (a) For the poset A = {1, 2, 3, 12) under the relation divides find the luh and gib of the following pairs of numbers if possible: 4 and 6, 
2 and 3, 10 and 4, 6 and 9. 

(b) Repeat part a for the set A, but use the relation <. 

Section 13.2 

3. Consider the poset IP under the relation "divides." 

(a) Compute: 4 V 8,3 V 15, 3 V 5, 4 A 8, 3 A 15,3 A 5 for [P, V, A]- 

(b) Is [P, V , A] a distributive lattice? Explain. 

(c) Does [P, V, A] have a least element? Does it have a greatest element? If so, what are they? 

4. Let [L, V . A] be a lattice and a, b e L. Prove: 

(a) a \J b = b\i and only lia < b. 

(b) a l\ b = a'li and only if a < 

5. Let L = |0, 1) and define <onLbyO<0<l<l. 

(a) Draw the Hasse diagram of this poset. 

(b) Write out the operation table for V and A on L observing that they are essentially the standard logical connectives. 

(c) Define the operations on iJ componentwise and draw the Hasse diagram for L} . 

(d) Repeat part (c) for L? . 

6. (a) Let [Li , \J , A] and [L2, V , A J be lattices. Prove that [Li XL2, V . A] is a lattice when the operations are defined componentwise as 
we did for algebraic systems in Section 1 1 .6. 

(b) Let L\ and L2 be lattices whose posets have the following Hasse diagrams respectively. List the elements in the lattice Li XL2. 



(0, a) V (0, b) 
(0, a) A (0, b) 
(1, a) V (1, b) 
(l,a)^(l,b) 
(0, 1)V(1, 0) 












(c) Compute: 



and(0, 1) A (1,0). 



Use this information as an aid to draw the Hasse diagram for L1XL2. 



7. (a) Is A = |1, 2, 3, 12) a lattice under the relation "divides"? Explain. 



(b) Is the set A above a lattice under the relation "less than or equal to"? Explain. 
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Section 13.3 

8. Using the rules of Boolean algebra, reduce the expression (xi V X2) V A X2) V (^i A X2) to the equivalent expression V X2 . Justify each 
step. 

9. Using the rules of Boolean algebra, reduce the expression {x + y)-{x + y) to a. simpler expression. 

10. Even a cursory examination of the basic laws for Boolean algebra (Table 13.3.1), for logic (Table 3.4.1), and for sets (Section 4.2) will 
indicate that they are the same in three different languages: they are isomorphic to one another as Boolean algebras. 

(a) Fill out the following table to illustrate the above concept: 





comparable 
connectives 


Sets 


u 








Logic 






A 




Boolean Algebraa 




< 







(b) Since the above algebras are isomorphic as Boolean algebras, any theorem true in one is true in the other two. Translate each of the 
following statements into the language of the other two. 

(i) p ^ q if and only if -iq^ ^ p. 

(ii) If A c B and A c C then A c 5 fl C 

(iii) If a > ^ and a>c then a > h \/ c. 

1 1 . (a) Determine the complements of each element described by the following Hasse diagram: 



I 




(b) Is the above lattice a Boolean algebra? 

12. (a) Determine the complement of each element in the lattice D^q. 

(b) Is ZJjQ a Boolean algebra? Explain. 

Section 13.4 

13. (a) Use the Theorem 13.4.2 and its Corollaries to determine which of the following are Boolean algebras: 

(a) D20 (b) £"27 (c) (d) D210 

(b) Notice that D„ is a Boolean algebra if and only if « is a product of distinct primes. Such an integer is called square free. What are the atoms 
of D„ if n is square free? 

14. Let [B, -, V . A] be any Boolean algebra of order 8. Find a Boolean algebra of sets that is isomorphic to B. How many atoms must B 
have? 

Section 13.5 

15. (a) List all sub-Boolean algebras of order 4 in B2^ 

(b) How many sub-Boolean algebras of order 4 are there in B2" ,n>Al 

(c) Discuss how the selection of atoms in a sub-Boolean algebra can be used to answer questions such as the one in part (b). 

16. Prove that Boolean algebras B2"xB2"and 62"" " are isomorphic. 

Section 13.6 

17 Find the minterm normal form of the Boolean expression (xi"Vx2)A^3 
18. Find the minterm normal form of the Boolean expression 
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X4 A (X3 V X2 V xi) V X3 A (x2 V xj) V X2 A xi 

19. Let B be a Boolean algebra of order 2. 

(a) How many rows are there in the table of a Boolean function of 3 variables? Of n variables? 

(b) How many different Boolean functions of 3 variables and of n variables are there? 

20. Let B be a Boolean algebra of order 2. 

(a) How many different minterm normal forms are there for Boolean expressions of 2 variables over B? List them. 

(b) How many different minterm normal forms are there for Boolean expressions of 3 variables over B? 

Section 13.7 

21 . Consider the following Boolean expression: 

f(Xi, X2, X3) = ((Xi + X2 +X3)-xr+Xi +jQ-Xi -X^ 

(a) Draw the switching circuit of/. 

(b) Draw the gate diagram of/. 

(c) Simplify /algebraically and draw the switching circuit and gate diagrams of this simplified version of/. 

22. Assume that each of the three members of a committee votes yes or no on a proposal by pressing a button that closes a switch for yes and 
does nothing for no. Devise as simple a switching-circuit as you can that will allow current to pass when and only when at least two of the 
members vote in the affirmative. 

23. (a) Find the Boolean function of this network: 
(b) Draw an equivalent 

24. Given the switching circuit 




y y 

(a) Express the switching circuit algebraically. 

(b) Draw the gate diagram of the expression obtained in part a. 

(c) Simplify the expression in part a and draw the switching-circuit and gate diagram for the simplified expression. 
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Monoids and Automata 



GOALS 

At first glance, the two topics that we will discuss in this chapter seem totally unrelated. The first is monoid theory, which we touched upon in 
Chapter 11. The second is automata theory, in which computers and other machines are described in abstract terms. After short independent 
discussions of these topics, we will describe how the two are related in the sense that each monoid can be viewed as a machine and each 
machine has a monoid associated with it. 

I 14.1 Monoids 

Recall the definition of a monoid: 

Definition: Monoid. A monoid is a set M together with a binary operation * with the properties 

(a) * is associative: (a*b)*c = a*{b *c) for all a, b, c e M, and 

(b) * has an identity: there exists e £ M such that for all a e M, a*e = e *a = a . 

Note: Since the requirements for a group contain the requirements for a monoid, every group is a monoid. 
Example 14.1.1 . 

(a) The power set of any set together with any one of the operations intersection, union, or symmetric difference is a monoid. 

(b) The set of integers, Z, with multiplication, is a monoid. With addition, Z is also a monoid. 

(c) The set of nxn matrices over the integers, M„(Z), n > 2, with matrix multiplication, is a monoid. This follows from the fact that matrix 
multiplication is associative and has an identity, /„. This is an example of a noncommutative monoid since there are matrices, A and B, for 
which A B +BA. 

(d) [Z„, x„J, n s 2, is a monoid with identity 1. 

(e) Let X be a nonempty set. The set of all functions from X into X, often denoted , is a monoid over function composition. In Chapter 7, 
we saw that function composition is associative. The function i .X ^X defined by i(a) = a is the identity element for this system. This is 
another example of a noncommutative monoid, provided l^l is greater than 1 . 

If X is finite, \X^\ = . For example, if B = |0, 1), |B*| =4. The functions z, u, i, and?, defined by the graphs in Figure 14.1.1, are the 
elements of B^ . This monoid is not a group. Do you know why? 

One reason that B^ is noncommutative is that tz + zt, since (tz) (0) = 1 and (zt) (0) = 0. 
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Figure 14.1.1 

The four elements of 

GENERAL CONCEPTS AND PROPERTIES OF MONOIDS 

Virtually all of the group concepts that were discussed in Chapter 11 are appUcable to monoids. When we introduced subsystems, we saw that a 
submonoid of monoid M is a subset of M —that is, it itself is a monoid with the operation of M. To prove that a subset is a submonoid, you can 

apply the following algorithm. 

Theorem/Algorithm 14.1.1. Let [M; *] be a monoid and K is a nonempty subset of M, K is a submonoid of M if and only if: 

(a) Ifa.beK, then a*b e K {i.e., Kis closed under *), and 

(b) the identity of M belongs to K. 

Often we will want to discuss the smallest submonoid that includes a certain subset 5 of a monoid M. This submonoid can be defined recur- 
sively by the following definition. 

Definition: Submonoid Generated by a Set. If S is a subset of monoid [M; *], the submonoid generated by S, (S), is defined by: 

(a) (Basis) (i) a e S =>a e (S), and(ii) the identity of M belongs to {S); 

(b) (Recursion), a, b £ (S) =^a*b e (S). 

Note: If S = {ai , 02, ...,a„}, we write <ai , 02, . . . , a„> in place of <{ai ,02, . . . , a„}}. 
Example 14.1^. 

(a) In[Z;+], <2> = {0, 2, 4, 6, 8, ...}. 

(b) The power set of Z, f(Z), over union is a monoid with identity 0. If S = {{1}, {2}, {3}}, then <S> is the power set of {1, 2, 3}. If 
S = {{n] : n e Z), then (S) is the set of finite subsets of the integers. 

MONOID ISOMORPHISMS 

Two monoids are isomorphic if and only if there exists a translation rule between them so that any true proposition in one monoid is translated 

to a true proposition in the other. 

Example 14.13. M = [Pll, 2, 3), f]] is isomorphic to M2 = [Z2;-J, where the operation in M2 is componentwise mod2 multiplication. 

A translation rule is that if A c { 1 , 2 , 3 } , then it is translated to (di, d2, dj) where di = i \ ' ^ ' ^ ^ . Two cases of how this translation rule 

yO III 

works are: 

{1, 2, 3} is the identity for Ml, and (1,2) 0(2,3)= (2) 

(1, 1, 1) is the identity for M2, and (1, 1, 0) • (0, 1 , 1) = (0, 1 , 0). 

A more precise definition of a monoid isomorphism is identical to the definition of a group isomorphism (see Section 1 1 .7). 

EXERCISES FOR SECTION 14.1 
A Exercises 

1 . For each of the subsets of the indicated monoid, determine whether the subset is a sub monoid. 

(a) 5i = (0, 2, 4, 6}and52 = |1, 3, 5, 7) in lZs;Xs]. 

(b) {/ e r^'^ : f(n) « n, V„ e 1^) and (/ e I^i^ : /(I) = 2) inl^^. 
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(c) {A c Z : A is finite} and {A c Z : A"^ is finite} in [P(Z); \J]. 

2. For each subset, describe the submonoid that it generates. 

(a) {3}and{0}in[Zi2;Xi2] 

(b) (5)in[Z25;X25j 

(c) the set of prime numbers and {2} in [P; •] 

(d) (3,5}inP; +] 
B Exercises 

3. Definition: Stochastic Matrix. An nxn matrix of real numbers is called stochastic if and only if each entry is nonnegative and the sum of 
entries in each column is 1 . Prove that the set of stochastic matrices is a monoid over matrix multipUcation. 

4. Prove Theorem 14.1.1. 



In this section, we will introduce the concept of a language. Languages are subsets of a certain type of monoid, the free monoid over an 
alphabet. After defining a free monoid, we will discuss languages and some of the basic problems relating to them. We wiU also discuss the 
common ways in which languages are defined. 

Let A be a nonempty set, which we wiU call an alphabet. Our primary interest wiU be in the case where A is finite; however, A could be infinite 
for most of the situations that we wiU describe. The elements of A are called letters or symbols. Among the alphabets that we will use are 
B = (0 , 1 ) , ASCn = the set of ASCII characters , and PAS = the Pascal character set (whichever one you use) . 

Definition: Strings over an Alphabet. A string of length n,n li 1, over A is a sequence of n letters from A ; O; 02 •••''«• T'^^ ""'^ string. A, 
is defined as the string of length zero containing no letters. The set of strings of length n over A is denoted by A". The set of all strings over A. 
is denoted A*. 

Notes: 

(a) If the length of string s is n, we write \s\ = n. 

(b) The null string is not the same as the empty set, although they are similar in many ways. 

(c) A* =A'' U A' U U A' U •••andif j. A' fl A^' = 0; that is, (A°, A', A^, A', ...} is a partition of A*. 

(d) An element of A can appear any number of times in a string. 
Theorem 142.1. If A is countable, then A* is countable. 

Proof: Case 1. Given the alphabet B = {0, 1}, we can define a bijection from the positive integers into B*. Each positive integer has a 

binary expansion df,df,_y ■■■ d\ rfo, where each dj is or 1 and di^ = 1. If n has such a binary expansion, then 2* s « < 2*+'. We define 

/ : P ->B*by/(n) =f(di^di^_i ■■■dido) = ■■■dido, where /(I) = X. Every one of the 2* strings of length k are the images of exactly one 
of the integers between 2* and 2**' - 1 . From its definition, / is clearly a bijection; therefore, B* is countable. 

Case 2: A is Finite. We will describe how this case is handled with an example first and then give the general proof. If A = [a, b, c, d, e), then 
we can code the letters in A into strings from . One of the coding schemes (there are many) is 
a 000, h ^001, c<->010, <i<->011, and e 100. Now every string in A" corresponds to a different string in 6*; for example, ace would 
correspond with 000010 100. The cardinality of A* is equal to the cardinality of the set of strings that can be obtained from this encoding 
system. The possible coded strings must be countable, since they are a subset of a countable set (B*); therefore. A* is countable. 

If |A| = m, then the letters in A can be coded using a set of fixed-length strings from B*. If 2*"' < m ^ 2*, then there are at least as many 
strings of length k in 6* as there are letters in A. Now we can associate each letter in A with an element of B*. Then any string in A* corre- 
sponds to a string in B*. By the same reasoning as in the example above. A* is countable. 

Case 3: A is Countably Infinite. We will leave this case as an exercise. ■ 

FREE MONOIDS OVER AN ALPHABET 

The set of strings over any alphabet is a monoid under concatenation. 

Deflnition: Concatenation. Let a = aia2--- a„ and b = bib2 ■■■ b„ be strings of length m and n, respectively. The concatenation of a with b, 
a o b, is the string of length m + n : aia2 ■■■ a„bib2 ■■■ b„. 

Notes: 

(a) The nuU string is the identity element of [A* ; concatenation] . Henceforth, we wiU denote the monoid of strings over A by A* . 

(b) Concatenation is noncommutative, provided |A| > 1. 




Monoids and Languages 
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(c) If |Ai| = IA2I , then the monoids A] and are isomorphic. An isomorphism can be defined using any bijec- 
tion/ : Ai -> A2.1fa = aia2 ■■■ Un e A|, /*(a) = /(ai)/(a2) ■■■ fio-n) defines a bijection from AJ into AJ. We will leave it to the reader to 
convince him or herself that for all a, e A| , /*(a <> b) = f*(a) <> f*(,b). 
LANGUAGES 

The languages of the world— English, German, Russian, Chinese, and so forth— are called natural languages. In order to communicate in 
writing in any one of them, you must first know the letters of the alphabet and then know how to combine the letters in meaningful ways. A 

formal language is an abstraction of this situation. 

Definition: Formal Language. If A is an alphabet, a formal language over A is a subset of A*. 
Example 14^.1. 

(a) English can be thought of as a language over the set of letters A, B, •••Z (upper and lower case) and other special symbols, such as 
punctuation marks and the blank. Exactly what subset of the strings over this alphabet defines the English language is difficult to pin down 
exactly. This is a characteristic of natural languages that we try to avoid with formal languages. 

(b) The set of all ASCII stream files can be defined in terms of a language over ASCII. An ASCII stream file is a sequence of zero or more 
lines followed by an end-of-file symbol. A line is defined as a sequence of ASCII characters that ends with the two characters CR (carriage 
return) and LF (line feed). The end-of-file symbol is system-dependent; for example, CTRL/C is a common one. 

(c) The set of all syntactically correct expressions in Mathematica is a language over the set of ASCII strings. 

(d) A few languages over B are 

Li = |i e B* I i has exactly as many 1 as ithasO' j), 
L2 = {1 <> J <> : J e B*}, and 

L3 = <0, 01) = the submonoid of B* generated by {0, 01}. 



TWO FUNDAMENTAL PROBLEMS: RECOGNITION AND GENERATION 

The generation and recognition problems are basic to computer programming. Given a language, L, the programmer must know how to write 
(or generate) a syntactically correct program that solves a problem. On the other hand, the compiler must be written to recognize whether a 
program contains any syntax errors. 

The Recognition Problem: Design an algorithm that determines the truth of j e L in a finite number of steps for all a e A*. Any such algo- 
rithm is called a recognition algorithm. 

Definition: Recursive Language. A language is recursive if there exists a recognition algorithm for it. 

Example \A22. 

(a) The language of syntactically correct Mathematica expressions is recursive. 

(b) The three languages in Example 14.2.1 (d) are all recursive. Recognition algorithms for Li and L2 should be easy for you to imagine. 
The reason a recognition algorithm for L3 might not be obvious is that L3 ' s definition is more cryptic. It doesn't tell us what belongs to L3, 
just what can be used to create strings in L3. This is how many languages are defined. With a second description of L3, we can easily design a 
recognition algorithm. L3 = e B*; s = Xor s starts with a and has no consecutive I's}. 

Algorithm 14.2.1: Recognition Algorithm for L3. Let j = Ji J2 £ B'. This algorithm determines the truth value of j e L3. The truth value is 
returned as the value of Word. 

(1) Word := true 

(2) If n >Othen 

If Ji = 1 then Word := false 
else for i := 3 to « 

if Si-i = 1 and 5; = 1 then Word := false 

The Gieneration Problem. Design an algorithm that generates or produces any string in L. Here we presume that A is either finite or countably 
infinite; hence. A* is countable by Theorem 14.2.1 , and L c A* must be countable. Therefore, the generation of L amounts to creating a list of 
strings in L. The list may be either finite or infinite, and you must be able to show that every string in L appears somewhere in the list. 

Theorem 14.2.2. 

(a) If A is countable, then there exists a generating algorithm for A*. 

(b) IfL is a recursive language over a countable alphabet, then there exists a generating algorithm for L. 
Proof: 

(a) Part a follows from the fact that A* is countable; therefore, there exists a complete list of strings in A*. 

(b) To generate all strings of L, start with a list of all strings in A* and an empty list, W, of strings in L. For each string s, use a recognition 
algorithm (one exists since L is recursive) to determine whether j e L. If j is in L, add it to W; otherwise "throw it out." Then go to the next 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed imder a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



348 



Chapter 14 - Monoids and Automata 



string in the list of A*. ■ 

Example 14^3. Since all of the languages in Example 14.2.2 are recursive, they must have generating algorithms. The one given in the proof 
of Theorem 14.2.2 is not generally the most efficient. You could probably design more efficient generating algorithms for L2 and L3; however, 
a better generating algorithm for Li is not quite so obvious. 

The recognition and generation problems can vary in difficulty depending on how a language is defined and what sort of algorithms we allow 
ourselves to use. This is not to say that the means by which a language is defined determines whether it is recursive. It just means that the truth 
of "L is recursive" may be more difficult to determine with one definition than with another. We will close this section with a discussion of 
grammars, which are standard forms of definition for a language. When we restrict ourselves to only certain types of algorithms, we can affect 
our ability to determine whether i e L is true. In defining a recursive language, we do not restrict ourselves in any way in regard to the type of 
algorithm that wiU be used. In Section 14.3, we wiU consider machines called ^n;fe automata, which can only perform simple algorithms. 

PHRASE STRUCTURE GRAMMARS AND LANGUAGES 

One common way of defining a language is by means of a phrase structure grammar (or grammar, for short). The set of strings that can be 
produced using the grammar rules is called the phrase structure language (of the grammar). 

Example 14.2.4. We can define the set of all strings over B for which all Os precede all Is as follows. Define the starting symbol S and establish 
rules that S can be replaced with any of the following: A, OS, or SI. These replacement rules are usually called production (or rewriting) rules 
and are usually written in the format S -» A, 5 -» 5, and S -> S 1 . Now define L to be the set of all strings that can be produced by starting with 
S and applying the production rules until S no longer appears. The strings in L are exactly the ones that are described above. 

Definition: Phrase Structure Grammar. A phrase structure grammar consists of four components: 

(1) A nonempty finite set of terminal characters, T. If the grammar is defining a language over A,Tis a subset of A* . 

(2) A finite set of nonterminal characters, N. 

(3) A starting symbol, S eN. 

(4) A finite set of production rules, each of the form X -*Y, where Xand Y are strings over A[J N such that X and X contains at least 
one nonterminal symbol. 

If G is a phrase structure grammar, L(C) is the set of strings that can be obtained by starting with S and applying the production rules a finite 
number of times until no nonterminal characters remain. If a language can be defined by a phrase structure grammar, then it is called a phrase 
structure language. 

Example 14.2.5. The language over B consisting of strings of alternating Os and Is is a phrase structure language. It can be defined by the 
following grammar: 

(1) Terminal characters: A, 0, and 1, 

(2) Nonterminal characters: S, T, and U, 

(3) Starting symbol: S, 

(4) Production rules: 

S^T, S^U, S^X, S^O, 5^1, 5^or, 
S^IU, T->WT, T^IO, U^OIU, U-^Ol 

These rules can be visualized more easily with a graph: 
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Figure 14.2.1 

Production rules for the language of alternating O's and 1 's. 

We can verify that a string such as 10101 belongs to the language by starting with S and producing 10101 using the production rules a finite 
numberof times: 5 ^ 1 [/^ 101 (/^ 10101. 

Example 14.2.6. Let G be the grammar with components: 

(1) Terminal symbols = all letters of the alphabet (both upper and lower case) and the digits through 9, 

(2) Nonterminal symbols ={/, X}, 

(3) Starting symbol: / 

(4) Production rules: I ^ a, where a is any letter, / -» aX for any letter a,X ^ fSX. for any letter or digit /?, and X -> jS for any letter or digit 
There are a total of 176 production rules for this grammar. The language L (G) consists of all valid Mathematica names. 

Backus-Naur form (BNF), A popular alternate form of defining the production rules in a grammar is BNF. If the production rules 
A -> B] , A -> B2. ■■■ ^ -> fi« are part of a grammar, they would be written in BNF as A :: = B\ IB2I ■ ■ ■ \B„. The symbol | in BNF is read as "or," 
while the :: = is read as "is defined as." Additional notations of BNF are that [x], represents zero or more repetitions of x and \y\ means that y is 
optional . 

Example 14.2.7. A BNF version of the production rules for a Mathematica name is 

letter :: = a\b\c--- |z | A |B | ••• |Z 
digit::=0| 1 I ••• |9 
/ :: = letter {letter | digit) 

Example 14.2.8. An arithmetic expression can be defined in BNF. For simplicity, we will consider only expressions obtained using 
addition and multiplication of integers. The terminal symbols are (,),+,*, -, and the digits through 9. The nonterminal symbols are E (for 
expression), T (term), F (factor), and A' (number). The starting symbol is E. 

E ■.■.=E + T\T 
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T ■. : = T * F \ F 
F :: = (£) I Af 
N :: = [-]digit{digit}. 

One particularly simple type of phrase structure grammar is the regular grammar. 

Definition: Regular Grammar. A regular (right-hand form) grammar is a grammar whose production rules are all of the form A^t and 
A tB, where A and B are nonterminal and t is terminal. A left-hand form grammar allows only A-^t and A-^Bt, A language that has a 
regular phrase structure language is called a regular language. 

Example 14^.9. 

(a) The set of Mathematica names is a regular language since the grammar by which we defined the set is a regular grammar. 

(b) The language of all strings for which all Os precede all Is (Example 14.2.4) is regular; however, the grammar by which we defined this 
set is not regular. Can you define these strings with a regular grammar? 

(c) The language of arithmetic expressions is not regular. 
EXERCISES FOR SECTION 14.2 

A Exercises 

1. (a) If a computer is being designed to operate with a character set of 350 symbols, how many bits must be reserved for each character? 
Assume each character will use the same number of bits. 

(b) Do the same for 3,500 symbols. 

2. It was pointed out in the text that the null string and the null set are different. The former is a string and the latter is a set, two different kinds 

of objects. Discuss how the two are similar. 

3. What sets of strings are defined by the following grammar? 

(a) Terminal symbols: A, and 1 

(b) Nonterminal symbols: S and E 

(c) Starting symbol: S 

(d) Production rules: 5^080, 5^181, S^E, E ^ X, E^O, E^l. 

4. What sets of strings are defined by the following grammar? 

(a) Terminal symbols: X, a, b, and c 

(b) Nonterminal symbols: S, T, V and E 

(c) Starting symbol: S 

(d) Production rules: S^aS, S^T, T^bT, T ^U, U^cU, U^E, E^X. 

5. Define the following languages over B with phrase structure grammars. 
Which of these languages are regular? 

(a) The strings with an odd number of characters. 

(b) The strings of length 4 or less. 

(c) The palindromes, strings that are the same backwards as forwards. 

6. Define the following languages over B with phrase structure grammars. Which of these languages are regular? 

(a) The strings with more Os than Is. 

(b) Thestrings with an even number of Is. 

(c) The strings for which all Os precede all Is. 

7. Prove that if a language over A is recursive, then its complement is also recursive. 

8. Use BNF to define the grammars in Exercises 3 and 4. 
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B Exercise 

CO 

9. (a)Prove that H Xi, X2, ... is a countable sequence of countable sets, the union of these sets, U is countable. 

(b) Using the fact that the countable union of countable sets is countable, prove that if A is countable, then A* is countable. 

1 14.3 Automata, Finite-State Machines 

In this section, we will introduce the concept of an abstract machine. The machines we will examine will (in theory) be capable of performing 
many of the tasks associated with digital computers. One such task is solving the recognition problem for a language. We wiU concentrate on 
one class of machines, finite-state machines (finite automata). And we will see that they are precisely the machines that are capable of recogniz- 
ing strings in a regular grammar. 

Given an alphabet X, we will imagine a string in X* to be encoded on a tape that we will call an input tape. When we refer to a tape, we 
might imagine a strip of material that is divided into segments, each of which can contain either a letter or a blank. 

The typical abstract machine includes an input device, the read head, which is capable of reading the symbol from the segment of the 
input tape that is currently in the read head. Some more advanced machines have a read/write head that can also write symbols onto the tape. 

The movement of the input tape after reading a symbol depends on the machine. With a finite-state machine, the next segment of the input tape 
is always moved into the read head after a symbol has been read. Most machines (including finite-state machines) also have a separate output 
tape that is written on with a write head. The output symbols come from an output alphabet, Z, that may or may not be equal to the input 
alphabet. The most significant component of an abstract machine is its memory structure. This structure can range from a finite number of bits 
of memory (as in a finite-state machine) to an iirfinite amount of memory that can be sorted in the form of a tape that can be read from and 
written on (as in a Turing machine). 

Definition: Finite-State Machine. A finite-state machine is defined by a quintet (S, X, Z, w, t) where 

(1) S = {sj, S2, ... , Sr] is the state set, a finite set that corresponds to the set of memory configurations that the machines can have at any 

time. 

(2) X = [xj,X2, x^] is the input alphabet. 

(3) Z = lZ],Z2, Zn] is the output alphabet. 

(4) w : X xS -» Z is the output function, which specifies which output symbol w(x, s) e Z is written onto the output tape when the machine is 
in state s and the input symbol x is read. 

(5) t : X xS S is the next-state ( or transition) function, which specifies which statetix, s) e S the machine should enter when it is in state s 
and it reads the symbol x. 

Example 143.1. Many mechanical devices, such as simple vending machines, can be thought of as finite-state machines. For simpUcity, 
assume that a vending machine dispenses packets of gum, spearmint (S), peppermint (P), and bubble (B), for 250 each. We can define the input 
alphabet to be (deposit 25 0, press 5, press/", press B| and the state set to be {Locked, Select), where the deposit of a quarter unlocks the 
release mechanism of the machine and allows you to select a flavor of gum. We will leave it to the reader to imagine what the output alphabet, 
output function, and next-state function would be. You are also invited to let your imagination run wild and include such features as a coin- 
return lever and change maker. 

Example 143.2. The following machine is called a parity checker. It recognizes whether or not a string in B* contains an even number of Is. 
The memory structure of this machine reflects the fact that in order to check the parity of a string, we need only keep track of whether an odd or 
even number of Is has been detected. 

(1) The input alphabet is B = {0, 1). 

(2) The output alphabet is also B. 

(3) The state set is {even, odd}. 

(4, 5) The following table defines the output and next-state functions: 



X 


s 


w(x, s) 


t(x, s) 





even 





even 





odd 


1 


odd 


1 


even 


1 


odd 


1 


odd 





even 



Note how the value of the most recent output at any time is an indication of the current state of the machine. Therefore, if we start in the even 
state and read any finite input tape, the last output corresponds to the final state of the parity checker and tells us the parity of the string on the 
input tape. For example, if the string 1 1001010 is read from left to right, the output tape, also from left to right, will be 10001 100. Since the last 

character is a 0, we know that the input string has even parity. 

An alternate method for defining a finite-state machine is with a transition diagram. A transition diagram is a directed graph that contains a 
node for each state and edges that indicate the transition and output functions. An edge (j,, sj) that is labeled x/z indicates that in state j, the 
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input X results in an output of z and the next state is sj. That is, w(x, Sj) = z and f(x, i,) = sj. The transition diagram for the parity checker 
appears in Figure 14.3.1. In later examples, we will see that if different inputs, x; and xj, while in the same state, result in the same transitions 
and outputs, we label a single edge Xj, Xjj z instead of drawing two edges with labels x;lz and Xjj z- 

One of the most significant features of a finite-state machine is that it retains no information about its past states that can be accessed by the 
machine itself. For example, after we input a tape encoded with the symbols 01101010 into the parity checker, the current state will be even, 
but we have no indication within the machine whether or not it has always been in even state. Note how the output tape is not considered part of 
the machine's memory. In this case, the output tape does contain a "history" of the parity checker's past states. We assume that the finite-state 
machine has no way of recovering the output sequence for later use. 




Figure 14.3.1 
Transition Diagram for a parity checker 

Example 143.3. Consider the following simplified version of the game of baseball. To be precise, this machine describes one half-inning of a 
simplified baseball game. Suppose that in addition to home plate, there is only one base instead of the usual three bases. Also, assume that there 
are only two outs per inning instead of the usual three. Our input alphabet will consist of the types of hits that the batter could have: out (O), 
double play (DP), single (S), and home run (HR). The input DP is meant to represent a batted ball that would result in a double play (two outs), 
if possible. The input DP can then occur at any time. The output alphabet is the numbers 0,1, and 2 for the number of runs that can be scored as 
a result of any input. The state set contains the current situation in the inning, the number of outs, and whether a base runner is currently on the 
base. The list of possible states is then 00 (for outs and runners), 01 , 10, 11, and end (when the half-inning is over). The transition diagram 
for this machine appears in Figure 14.3.2. 

Let's concentrate on one state. If the current state is 01, outs and 1 runner on base, each input results in a different combination of output and 
next-state. If the batter hits the ball poorly (a double play) the output is zero runs and the inning is over (the limit of two outs has been made). A 
simple out also results in an output of runs and the next state is 11, one out and one runner on base. If the batter hits a single, one run scores 
(output =1) while the state remains 01 . If a home run is hit, two runs are scored (output = 2) and the next state is 00. If we had allowed three 
outs per inning, this graph would only be marginally more complicated. The usual game with three bases would be quite a bit more compli- 
cated, however. 




Figure 14.3.2 

Transition Diagram for a simplified game of baseball 



RECOGNITION IN REGULAR LANGUAGES 

As we mentioned at the outset of this section, finite-state machines can recognize strings in a regular language. Consider the language L over 
[a, b, c] that contains the strings of positive length in which each a is followed by b and each b is followed by c. One such string is bccabcbc. 
This language is regular. A grammar for the language would be nonterminal symbols [A, B, C] with starting symbol C and production rules 
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A^bB, cC, C aA, C ^ cC and C -> c. A finite-state machine (Figure 14.3.3) that recognizes this language can be constructed with one 
state for each nonterminal symbol and an additional state (Reject) that is entered if any invalid production takes place. At the end of an input 
tape that encodes a string in [a, b, c\*, we will know when the string belongs to L based on the final output. If the final output is 1, the string 
belongs to L and if it is 0, the string does not belong to L. In addition, recognition can be accomplished by examining the final state of the 
machine. The input string belongs to the language if and only if the final state is C. 

The construction of this machine is quite easy: note how each production rule translates into an edge between states other than Reject. 
For example, C -> foB indicates that in State C, an input of b places the machine into State B. Not all sets of production rules can be as easily 
translated to a finite-state machine. Another set of production rules for L is A -> aB, B bC, C -> cA, C ^ cB, C ^ cC and C -> c. Tech- 
niques for constructing finite-state machines from production rules is not our objective here. Hence we will only expect you to experiment with 
production rules until appropriate ones are found. 




Figure 14.3.3 

Example 14.3.4. A finite-state machine can be designed to add positive integers of any size. Given two integers in binary form, 
and b = b„ b„-i ■■■ bibo,the machine will read the input sequence, which is obtained from the digits of a and b reading from 

right to left, 

ao bo(ao +2bo) , ... , a„ b„(a„ +2 b„), 

followed by the special input 111. Note how all possible inputs except the last one must even parity (contain an even number of ones). The 
output sequence is the sum of a and b, starting with the units digit, and comes from the set {0, 1, A). The transition diagram for this machine 
appears in Figure 14.3.4. 



Start 




Figure 14.3.4 
Transition Diagram for a binary adder 

EXERCISES FOR SECTION 14.3 

A Exercises 

1 . Draw a transition diagram for the vending machine described in Example 
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14.3.1. 

2. Construct finite-state machines that recognize the regular languages that you identified in Section 14.2. 

3. What is the input set for the machine in Example 14.3.4? 

4. What input sequence would be used to compute the sum of 1 101 and 0111 (binary integers)? What would the output sequence be? 
B Exercise 

5. The Gray Code Decoder. The finite-state machine defined by the following figure has an interesting connection with the Gray Code 
(Section 9.4). 




Figure 14.3.5 
Gray Code Decoder 

Given a string x = X] X2 ■■■ x„ e B" , we may ask where x appears in G„. Starting in Copy state, the input string x will result in an output string 
zeB", which is the binary form of the position of x in G„ Positions are numbered from to 2" - 1 . 

(a) In what positions (0-31) do 101 10, 00100, and 1 1 1 1 1 appear in G5? 

(b) Prove that the Gray Code Decoder always works. 

I 14.4 The Monoid of a Finite-State Machine 

In this section, we will see how every finite-state machine has a monoid associated with it. For any finite-state machine, the elements of its 
associated monoid correspond to certain input sequences. Because only a finite number of combinations of states and inputs is possible for a 
finite-state machine there is only a finite number of input sequences that summarize the machine This idea is illustrated best with a few 
examples . 

Example 14.4.1. Consider the parity checker. The following table summarizes the effect on the parity checker of strings in B' and B^ . 
The row labeled "Even" contains the final state and final output as a result of each input string in B' and B^ when the machine starts in the 
even state. Similarly, the row labeled "Odd" contains the same information for input sequences when the machine starts in the odd state. 



Input String 





1 


00 


01 


10 


11 


Even 


(Even, 0) 


(Odd, 1) 


(Even, 0) 


(Odd, 1) 


(Odd, 1) 


(Even, 0) 


Odd 


(Odd, 1) 


(Even, 1) 


(Odd, 1) 


(Even, 1) 


(Even, 0) 


(Odd, 1) 


Same Effect as 









1 


1 






Note how, as indicated in the last row, the strings in B- have the same effect as certain strings in B' . For this reason, we can summarize the 
machine in terms of how it is affected by strings of length 1. The actual monoid that we will now describe consists of a set of functions, and 
the operation on the functions will be based on the concatenation operation. 

Let To be the final effect (state and output) on the parity checker of the input 0. Similarly, Ti is defined as the final effect on the parity 
checker of the input 1 . More precisely, 

7()(even) = (even, 0) and 7()(odd) = (odd, 1) , 

while 

7"i(even) = (odd, 1) and ri(odd) = (even, 0). 

In general, we define the operation on a set of such functions as follows: if i, f are input sequences and T, and 7",, are functions as above, then 
T^-iTt = Tst, that is, the result of the function that summarizes the effect on the machine by the concatenation of s with t. Since, for example, 
01 has the same effect on the parity checker as l,To*Ti = Tqi = Ti. We don't stop our calculation at 7bi because we want to use the shortest 
string of inputs to describe the final result. A complete table for the monoid of the parity checker is 





To 


Ti 


To 


To 


Ti 


Ti 


Ti 


To 



What is the identify of this monoid? The monoid of the parity checker is isomorphic to the monoid [Z2, -1-2] • 

This operation may remind you of the composition operation on functions, but there are two principal differences. The domain of 7",. is not the 
codomain of T, and the functions are read from left to right unlike in composition, where they are normally read from right to left. 

You may have noticed that the output of the parity checker echoes the state of the machine and that we could have looked only at the effect 
on the machine as the final state. The following example has the same property, hence we will only consider the final state. 

Example 14.4.2. The transition diagram for the machine that recognizes strings in B* that have no consecutive I's appears in Figure 
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14.4.1. Note how it is similar to the graph in Figure 9.1.1. Only a "reject state" has been added, for the case when an input of 1 occurs while 
in State a. We construct a similar table to the one in the previous example to study the effect of certain strings on this machine. This time, we 
must include strings of length 3 before we recognize that no "new effects" can be found. 



a 




Figure 14.4.1 



Inputs 





1 


00 


01 


10 


11 


000 


001 


010 


Oil 


100 


101 


110 


111 


s 


b 


a 


b 


a 


b 


r 


b 


a 


b 


r 


b 


a 


r 


r 


a 


b 


r 


b 


a 


r 


r 


b 


a 


b 


r 


r 


r 


r 


r 


b 


b 


a 


b 


a 


b 


r 


b 


a 


b 


r 


b 


a 


r 


r 


r 


r 


r 


r 


r 


r 


r 


r 


r 


r 


r 


r 


r 


r 


r 


Same as 


















01 





11 


10 


1 


11 


11 



The following table summarizes how combinations of the strings 0, 1,01, 10, and 1 1 affect this machine. 





To 


Ti 


Toi 


Tio 


Tn 




To 


Ti 


Toi 


Tw 


Tu 


Ti 


Tio 


Tn 


Ti 


Tn 


Tu 


Tm 


To 


Tu 


Toi 


Tn 


Tu 


Tio 


Tm 


T, 


Ti 


Tm 


Tu 


Tu 


Tu 


Tn 


Tu 


Tn 


Tu 



All the results in this table can be obtained using the previous table. For example, 

Tm * Toi = T'looi = T'loo * = Tm *Ti= Tm\ = Ti 
and 

Toi*To\ = Toioi = Tom Ti =ToTi = Tqi . 



Note that none of the elements that we have listed in this table serves as the identity for our operation. This problem can always be remedied 
by including the function that corresponds to the input of the null string, 7"^. Since the null string is the identity for concatenation of strings, 
= T•^ = T, for all input strings s. 

Example 14.4.3. A finite-state machine called the unit-time delay machine does not echo its current state, but prints its previous state. 
For this reason, when we find the monoid of the unit- time delay machine, we must consider both state and output. The transition diagram of 
this machine appears in Figure 14.4.2. 




Figure 14.4.2 
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100 


101 


110 


111 


Input 





1 


00 


01 


10 


11 


or 


or 


or 


or 
















000 


001 


101 


Oil 





(0,0) 


(1,0) 


(0,0) 


(1,0) 


(0, 1) 


(1, 1) 


(0,0) 


(1,0) 


(0, 1) 


(1, 1) 


1 


(0, 1) 


(1, 1) 


(0,0) 


(1,0) 


(0, 1) 


(1, 1) 


(0,0) 


(1,0) 


(0, 1) 


(1, 1) 


Same as 














00 


01 


10 


11 



Again, since no new outcomes were obtained from strings of length 3, only strings of length 2 or less contribute to the monoid of the 
machine. The table for the strings of positive length shows that we must add ^^ to obtain a monoid. 





To 


Ti 


Too 


Toi 


Tio 


Tn 


To 


Too 


Toi 


Too 


Toi 


Tio 


Tn 


T, 


Tm 


Tn 


Too 


Toi 


Tio 


Tn 


Too 


Too 


Toi 


Too 


Toi 


Tio 


Tn 


Toi 


Tm 


Tn 


Too 


Toi 


Tio 


Tn 


Tm 


Too 


Toi 


Too 


Toi 


Tio 


Tn 


Tn 


Tio 


Til 


Too 


Toi 


Tio 


Tn 



EXERCISES FOR SECTION 14.4 



A Exercise 

1 . For each of the transition diagrams in Figure 14.4.3, write out tables for their associated monoids. Identify the identity in terms of a string of 
positive length, if possible. {Hint: Where the output echoes the current state, the output can be ignored.) 

(b) 




D 




B 




~~ " 1 





/\ 



a,b,c 



\/ 

c 



Figure 14.4.3 



B Exercise 

2. What common monoids are isomorphic to the monoids obtained in the previous exercise? 

Exercise 

3. Can two finite-state machines with nonisomorphic transition diagrams have isomorphic monoids? 



1 14.5 The Machine of a Monoid 



Any finite monoid [M, *] can be represented in the form of a finite-state machine with input and state sets equal to M. The output of the 
machine will be ignored here, since it would echo the current state of the machine. Machines of this type are called state machines. It can be 
shown that whatever can be done with a finite-state machine can be done with a state machine; however, there is a trade-off. Usually, state 
machines that perform a specific function are more complex than general finite-state machines. 

Definition: Machine of a Monoid. If[M, *] is a finite monoid, then the machine of M, denoted m(M), is the state machine with state set M, 
input set M, and next-state function t : MxM -> M defined by t (s , x) = s * x. 

Example 14.5.1. We will construct the machine of the monoid [Z2; -1-2]. As mentioned above, the state set and the input set are both Z2. 
The next state function is defined by t (s, x) = ^ -1-2 x. The transition diagram for m(Z.2) appears in Figure 14.5.1 . Note how it is identical to 
the transition diagram of the parity checker, which has an associated monoid that was isomorphic to [Z2, +2]- 
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Figure 14.5.1 



Example 14.5.2. The transition diagram of the monoids [Z2, X2] and [Z3, X3] appear in Figure 14.5.2. 

(b) 




Figure 14.5.2 

Example 14.5.3. Let U be the monoid that we obtained from the unit-time delay machine (Example 14.4.3). We have seen that the 
machine of the monoid of the parity checker is essentially the parity checker. Will we obtain a unit-time delay machine when we construct the 
machine of (7? We can't expect to get exactly the same machine because the unit-time delay machine is not a state machine and the machine 
of a monoid is a state machine. However, we will see that our new machine is capable of telling us what input was received in the previous 
time period. The operation table for the monoid serves as a table to define the transition function for the machine. The row headings are the 
state values, while the column headings are the inputs. If we were to draw a transition diagram with all possible inputs, the diagram would be 
too difficult to read. Since U is generated by the two elements, Tq and Ti, we will include only those inputs. Suppose that we wanted to read 
the transition function for the input Tqi. Since Tqi =TqTi, in any state s, t(s, 7oi) = t(l(s, Tq), Ti). The transition diagram appears in 
Figure 14.5.3. 




Figure 14.5.3 

If we start reading a string of Os and Is while in state 7\ and are in state T^i, at any one time, the input from the previous time period (not the 
input that sent us into T^^,, the one before that) is a. In states T^, Tq and Ti , no previous input exists. 

EXERCISES FOR SECTION 14.5 
A Exercise 

1 . Draw the transition diagrams for the machines of the following monoids: 
(a) 

(b) The direct product of [Z2; X2] with itself. 
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B Exercise 

2. Even though a monoid may be infinite, we can visualize it as an infinite-state machine provided that it is generated by a finite number of 
elements. For example, the monoid B* is generated by and 1. A section of its transition diagram can be obtained by allowing input only from 
the generating set (Figure 14.5.4a). The monoid of integers under addition is generated by the set {-1, 1}. The transition diagram for this 
monoid can be visualized by drawing a small portion of it, as in Figure 14.5.4b. 




Figure 14.5.4 



(a) Draw a transition diagram for {a, b, c}*. 

(b) Draw a transition diagram for [Z x Z , componentwise addition] . 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 14 

Section 14,1 

1. Let fi be a Boolean algebra and M the set of all Boolean functions on B. Let * be defined on M by (/ * g) (a) = f(a) A Prove that 
[M, *] is a monoid. Construct the operation table of [M , *] for the case of B = 62- 

2. A semigroup is an algebraic system [S, ■*] with the only axiom that * be associative on S. Prove that if 5 is a finite set, then there must exist 
an idempotent element, that is, an a e S such that a * a = a. 

Section 14.2 

3. What language does the following grammar define? The start symbol is S, the terminal symbols are a and b, and the nonterminal symbols 
are 5 and B. The production rules are 5 a, S ^ b B, B ^ b, B ^ b S. 

4. What language does the following grammar define? Start symbol = S. nonterminal symbols: T, R. Production rules: S ^ T , S ^ R, 
T -> bR,R -^aT,T ^ b,R ^ a. 

5 . Write a regular grammar for the language L over the alphabet {a, b) where L is the set of all strings with exactly two b'&. 

6. Write a regular grammar to describe the strings of O's and I's that consist of a positive number of O's surrounded by single I's. For 
example, 100001 is one such string. 

Section 14.3 

7. Draw a finite-state machine to recognize the language in Exercise 5. Have the last output be 1 if the input word is in L, and it is not in L. 

8. Draw a transition diagram for a finite-state machine that recognizes strings in the language of Exercise 6. 

9. A finite-state machine moves once every time unit between one of four states called Right, Middle, Left, and Down. The input alphabet is 
X = (00, 01, 10, 11) and the output alphabet is y = (1, 0). 

Lett Middle Right 

I 

Down 

(i) If the machine is in the Middle, Right, or Left, 00 means that it stays where it is; 01 means that it moves one state to the right (e.g. 
Left to Middle.)— if it can't move any farther to the right, it stays where it is; 

10 means that it moves one state to the left. 

(ii) Input of 1 1 means that the machine stays where it is except if it is in the Middle, in which case it enters the Down state. 

(iii) If the machine is in the Down state, it stays in that state forever. 

(iv) Output is 1 if the state of the machine changes, otherwise. 

(a) Construct the transition diagram for this finite-state machine. 

(b) If s (0) = Middle and .s(3) = Down, list the possible output sequences that could have occurred for ? = 0, 1, 2. 
Section 14.4 

10. Write out the operation table for the monoid of the machine in Exercise 9. Section 14.5 

1 1 . Draw the transition diagram of the machine of [M , *] in Exercise 1 of these supplementary exercises. 
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Chapter 15 




GROUP THEORY AND APPLICATIONS 



Abelian Group 

In Abelian groups, when computing, 
With operands there's no refuting: 
The expression be 
Is the same as cb. 

Not en route to your job, yet commuting. 

- limerick by Howard Spindel in The Omnificent English Dictionary In Limerick Form 

GOALS 

In Chapter 11, groups were introduced as a typical algebraic system. The associated concepts of subgroup, group isomorphism, and direct 
products of groups were also introduced. Groups were chosen for that chapter because they are among the simplest types of algebraic systems. 
Despite this simplicity, group theory abounds with interesting applications, many of which are of interest to the computer scientist. In this 
chapter we will introduce some more important concepts in elementary group theory, and some of their applications. 
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1 15.1 Cyclic Groups 

Groups are classified according to their size and structure. A group's structure is revealed by a study of its subgroups and other properties (e.g., 
whether it is abelian) that might give an overview of it. Cyclic groups have the simplest structure of all groups. 

Definitions: Cyclic Group, Generator. Group G is cyclic if there exists a e G such that the cyclic subgroup generated hy a, (a), equals 
all of G. That is, G = [na \ n e Z), in which case a is called a generator of G. The reader should note that additive notation is used for G. 

Example 15.1.1. Z12 = V^\2 , +12 J> where +12 is addition modulo 12, is a cyclic group. To verify this statement, all we need to do is 
demonstrate that some element of Z12 is a generator. One such element is 5; that is, (5) = Z12 . One more obvious generator is 1. In fact, 1 is 
a generator of every [Z„; +„]. The reader is asked to prove that if an element is a generator, then its inverse is also a generator. Thus, -5=7 
and -1 = 1 1 are the other generators of Z12. 




Figure 15.1.1 
Examples of "string art" 

Figure 15.1.1(a) is an example of "string art" that illustrates how 5 generates Z12. Twelve tacks are placed along a circle and numbered. A 
string is tied to tack 0, and is then looped around every fifth tack. As a result, the numbers of the tacks that are reached are exactly the ordered 
multiples of 5 modulo 12: 5, 10, 3, ... , 7, 0. Note that if every seventh tack were used, the same artwork would be obtained. If every third 
tack were connected, as in Figure 15.1 .1(b), the resulting loop would only use four tacks; thus 3 does not generate Z12 . 

Example 15.1.2. The group of additive integers, [Z; +], is cyclic: 
Z = (1) = {n-1 |n e Z). 
This observation does not mean that every integer is the product of an integer times 1 . It means that 

n terms n terms 

Z = {0} u{l + l"+^- + l I nelP}u{(-l) + (-l)T---+(-l) I neP} 
Theorem 15.1.1. If [G *] is cyclic, then it is abelian. 

Proof; Let a be any generator of G and let b, c & G. By the definition of the generator of a group, there exists integers m and n such that 
b = TO a and c = n a. Thus 

b*c = (ma)*(na) 

= (m +n)a by Theorem 1 1 .3.7(ii) 

= {n +m)a 

= (na)*(nb) 
= c*b ■ 

One of the first steps in proving a property of cyclic groups is to use the fact that there exists a generator. Then every element of the group can 
be expressed as some multiple of the generator. Take special note of how this is used in theorems of this section. 

Up to now we have used only additive notation to discuss cyclic groups. Theorem 15.1.1 actually justifies this practice since it is customary to 
use additive notation when discussing abelian groups. Of course, some concrete groups for which we employ multiplicative notation are cyclic. 
If one of its elements, a, is a generator, 

(a) = (a" I n e Z) 

Example 15.13. The group of positive integers modulo 11 with modulo 11 multiplication, [Zn* ;Xii J, is cyclic. One of its generators is 
6: 6' = 6,6^ = 3,6^= 7,... , 6' = 2, and 6 = 1, the identity of the group. 
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Example 15.14. The real numbers with addition, [R; +] is a noncyclic group. The proof of this statement requires a bit more generaUty 
since we are saying that for all r e R, (r) is a proper subset of R. If r is nonzero, the multiples of r are distributed over the real line, as in 
Figure 15.1.2. It is clear then that there are many real numbers, like r/2, that are not in (r). 

-3r -Ir -Ir Or Ir 2r 3r 



Figure 15.1.2 
Elements of (r), r > 

The following theorem shows that a cyclic group can never be very complicated. 

Theorem 15.12. If G is a cyclic group, then G is either finite or countably infinite. IfG is finite and \G\ = n, it is isomorphic to [Z„, +„]. 
IfG is infinite, it is isomorphic to [Z, +]. 

Proof: Case 1: |G| < cxi. If a is a generator of G and |G| = n, define : Z„ -> G by 
(pik) = ka for all k &J-n 

Since (a) is finite, we can use the fact that the elements of (a) are the first n nonnegative multiples of a. From this observation, we see that ^ 
is a surjection. A surjection between finite sets of the same cardinaUty must be a bijection. Finally, if ^ e Z„, 

<^(p) + 0(q) = pa + qa 

= (p + q)a 

= ip+n<l)(i see exercise 10 
= 4>(P +n?) 
Therefore is an isomorphism. 

Case 2; |G| = oo. We will leave this case as an exercise. ■ 

The proof of the next theorem makes use of the division property for integers, which was introduced in Section 11.4: If m, n are integers, 
m > 0, there exist unique integers q (quotient) and r (remainder) such that n = qm + r and < r < m. 

Theorem 15.13. Every subgroup of a cyclic group is cyclic. 

Proof: Let G be cyclic with generator a and let H < G.\i H = [e], H has e as a generator. We may now assume that \H\ > 2 and ate. 
Let m be the least positive integer such that m a belongs to H. (This is the key step. It lets us get our hands on a generator of H) We will now 
show that c = ma generates H. Suppose that (c) H. Then there exists h e H such that b ^ (c). Now, since is in G, there exists n e Z 
such that b = na. We now apply the division property and divide n by m. 

b = na = {qm + r)a = {qm)a + r a, 
whereO <r<m. We note that r caimot be zero for otherwise we would have b = na = qima) = qc e (c). Therefore, 

ra = na - {qm)a e H 

This contradicts our choice of m because < r < m. ■ 

Example 15.1.5. The only proper subgroups of Zio are Hi = {0, 5} and H2 = {0, 2, 4, 6, 8}. They are both cycUc: Hi = (5), while 
H2 = (2) = (4) = (6) = (8). The generators of Zio are 1,3, 7, and 9. 

Example 15.1 .6. With the exception of |0), all subgroups of Z are isomorphic to Z. If // < Z, then H is the cycUc subgroup generated 
by the least positive element of H. It is infinite and so by theorem 15.1 .2 it is isomorphic to Z. 

We now cite a useful theorem for computing the order of cycUc subgroups of a cycUc group: 

Theorem 15.1.4. If G is a cyclic group of order n and a is a generator ofG, the order ofka is n/d, where d is the greatest common 
divisor of n and k. 

The proof of this theorem is left to the reader. 

Example 15.1 .7. To compute the order of (18) in Z30, we first observe that 1 is a generator of Z30 and 18 = 18(1). The greatest common 
divisor of 18 and 30 is 6. Hence, the order of (18) is 30/6, or 5. 

APPLICATION: FAST ADDERS 

At this point, we will introduce the idea of a fast adder, a relatively modem appUcation (Winograd, 1965) to an ancient theorem, the Chinese 
Remainder Theorem. We wiU present only an overview of the theory and rely primarily on examples. The interested reader can refer to 
Domhoff and Hohn for details. 

Out of necessity, integer addition with a computer is addition modulo n, for n some larger number. Consider the case where n is small, like 64. 
Then addition involves the addition of six-digit binary numbers. Consider the process of adding 31 and 1. Assume the computer's adder takes 
as input two bit strings a = {ao, a\, 02, 03, 04, a^] and b = {bo, b\, 62, &4, ^5} and outputs s = {so, si, S2, S3, S4, s^}, the sum of a and b. 
Then, if a = 31 = (1, 1, 1, 1, 1, 0) and 6 = 1 = (1, 0, 0, 0, 0, 0), s will be (0, 0, 0, 0, 0, 1), or 32. The output J5 = 1 cannot be deter- 
mined until all other outputs have been determined. If addition is done with a finite-state machine, as in Example 14.3.5, the time required to 
obtain s will be six time units, where one time unit is the time it takes to get one output from the machine. In general, the time required to 
obtain s will be proportional to the number of bits Theoretically, this time can be decreased, but the explanation would require a long digres- 
sion and our relative results would not change that much. We wiU use the rule that the number of time units needed to perform addition modulo 
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n is proportional to riog2 «1 • 

Now we will introduce a hypothetical problem that we will use to illustrate the idea of a fast adder. Suppose that we had to add many numbers 
modulo 27720 = 8- 9 • 5 • 7- 11. By the rule above, since 2^^ < 27720 < 2", each addition would take 15 time units. If the sum is 
initialized to zero, 1,000 additions would be needed; thus, 15,000 time units would be needed to do the additions. We can improve this time 
dramatically by applying the Chinese Remainder Theorem. 

The Chinese Remainder Theorem (CRT). Let Uj, rip be integers that have no common factor greater than one between any pair 

of them; i. e., they are relatively prime. Let n = «y«2 ■■■ n,,. Define 

ff :Z„^ Z„, xZ„, X ■■■xZn^ 

by 

m ={k„k2 kp) 

where for 1 < i < p, < k/ < «/ and k = ki(mod ni). Then is an isomorphism from Z'„ into Z'„, xZ'„, x ■■■ xZ„^ . 
This theorem can be stated in several different forms, and its proof can be found in many abstract algebra texts. 

Example 15.1.8. As we saw in Chapter 11, is isomorphic to Z2 x Z3 . This is the smallest case to which the CRT can be applied. An 
isomorphism between Zj and Z2 x Z3 is 



(0) 


= (0, 0) 


6(3) 


= (1,0) 


(1) 


= (1, 1) 


6(4) 


= (0, 1) 


(2) 


= (0, 2) 


B(5) 


= (1,2) 



Let's consider a somewhat larger case. We start by selecting a modulus that can be factored into a produce to relatively prime integers. 

n = 2= 3^ 5= 

21600 



In this case the factors are 2' = 32, 3'' = 27, and 5^ = 25. They need not be powers of primes, but it is easy to break the factors into this form to 
assure relatively prime numbers. To add in Z„, we need riog7 n] = 15 time units. Let G = Z32 XZ27 XZ25. The CRT gives us an isomorphism 
between Z21600 and G. The basic idea behind the fast adder, illustrated in Figure 15.1.3, is to make use of this isomorphism. 




, "1 +tTl 




OCIV s\ ^ 


"J -^tij 




Jie» Ji ^ 












IKV Si ^ 






Dew 35 X 




FieiMt 1S.U 

Assume we have several integers a 1 , . . 



a„, to be added. Here, we assume m = 20. 



a = {1878, 1384, 84, 2021, 784, 1509, 1740, 1201, 

2363, 1774, 1865, 33, 1477, 894, 690, 520, 198, 1349, 1278, 650}; 

After each of the Si' s is initialized to zero, each summand t is decomposed into a triple 0{t) = (t\ , f2, (3) e G. For our example we first define 6 
as a Mathematica function and then map it over the list of summands. 

: = {Mod [n, 32] , Mod[n, 27] , Mod [n, 25] } 
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dlstrlbutedSummands = Map [3, a] 



(22 


15 


3 \ 


8 


7 


9 


20 


3 


9 


5 


23 


21 


16 


1 


9 


5 


24 


9 


12 


12 


15 


17 


13 


1 


27 


14 


13 


14 


19 


24 


9 


2 


15 


1 


6 


8 


5 


19 


2 


30 


3 


19 


18 


15 


15 


8 


7 


20 


6 


9 


23 


5 


26 


24 


30 


9 


3 


, 10 


2 


, 



Addition in C can be done in parallel so that each new subtotal in the form of the triple (si , S2, s 3 ) takes only as long to compute as it takes to 
add in the largest modulus, logj 32 = 5 time units, if calculations are done in parallel. By the time rule that we have established, the addition of 
20 numbers can be done in 20 x 5 = 100 time units, as opposed to 15 x20 = 300 time units if we do the calculations in Z„. 
The result of adding the distributed summands in the three different moduli for our example would be the following. 

dlstrlbutedSum = 

Fold[{Mod[ttl[ [1]] +tt2[[l]], 32], Mod[ttl[[2]] +tt2[[2]], 27], Mod[ttl[[3]] +tt2[[3]], 25]} S, 
{0, 0, 0}, dlstrlbutedSummands] 

{12, 13, 17} 

Two more factors must still be considered, however. How easy is it to determine 6(a) and O'^isi, S2, ^3)? We must compute 6(a) twenty times, 
and, if it requires a sizable amount of time, there may not be any advantage to the fast adder. The computation of an inverse is not as time- 
critical since it must be done only once, after the final sums are determined in G. 

The determination of 6(a) is not a major problem. If the values of 6(1), 6(10), 0(100), 0(1000), and 0(10000) are stored, 
a = do + lOdi + 100^2 + lOOOds + 10000^4, then 

61(a) = do 0(1) + di 0(10) + ^2 6»(100) + d^ 0(1000) + ^4 0(10000) 

by the fact that is an isomorphism. The components of 6(a) can be computed economically using this formula so as not to slow down the 
actual adding process. 

The computation of 6~'^(si, S2, s^) is simplified by the fact that 0~* is also an isomorphism. The final sum is 

•Vi 0"'(1 . 0, 0) + .S2 '(0. 1 , 0) + .V3 0"'(O, 0, 1). The arithmetic in this expression is in Z21 600 and is more time consuming. However, as was 
noted above, it need only be done once. This is why the fast adder is only practical in situations where many additions must be performed to get 
a single sum. 

For our example, we can use Mathematica's built-in function for inverting 0: 
ChineseRemalnder [dlstrlbutedSum, {32, 27, 25}] 

2092 

The result we get is exactly what we get by directly adding in the larger modulus. 
Fold[Mod[ttl + n2, 32 x27 x 25] &, 0, a] 

2092 

Notice that if we wanted the conventional sum of integers our list, the result we just arrived at would not be correct. The relationship between 
the integer sum and the modular sum is that they differ by a multiple of the modulus: 
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Total [a] 

23692 

Mod [Total [a] - Fold [Mod [ttl + tt2, 32 x 27 x 25] &, 0, a] , 32 x 27 x 25] 



To further illustrate the potential of fast adders, consider the problem of addition modulo 

n = 2^3352 72 1M3-17-19-23-29-31-37-41-43-47 -3.1x10^1 

Each addition using the usual modulo n addition with fuU adders would take 72 time units. By decomposing each summand into 15-tuples 
according to the CRT, the time is reduced to riog2 491 = 6 time units per addition. 

EXERCISES FOR SECTION 15.1 
A Exercises 

1. What generators besides 1 does [Z, +] have? 

2 . Without doing any multipUcations , determine the number of generators of [Z n , + n ] . 

3. Prove that if |G| > 2 and G is cyclic, G has at least two generators. 

4. If you wanted to list the generators of Z„ you would only have to test the first n/2 positive integers. Why? 

5 . Which of the following groups are cyclic? Explain. 

(a) [Q, +] 

(b) [R\ •] 

(c) [6Z, +]where6Z = {6n |n e Z) 

(d) Z X Z 

(e) Z2XZ3 XZ25 

6. For each group and element, determine the order of the cyclic subgroup generated by the element: 

(a) Z25,15 

(b) Z4XZ9 , (2, 6) (apply Exercise 8) 

(c) Z64,2 

B Exercises 

7. How can Theorem 15.1 .4 be apphed to hst the generators of Z„? What are the generators of Z25? Of Z256? 

8 . Prove that if the greatest common divisor of n and m is 1 , then ( 1 , 1 ) is a generator of Z„xZ^, and , hence , Z„ x Z„ is isomorphic to Z„ „ . 

9 . (a) Illustrate how the fast adder can be used to add the numbers 21,5,7, and 1 5 using the isomorphism between Z77 and Z7 x Z 1 1 . 

(b) If the same isomorphism is used to add the numbers 25, 26, and 40, what would the result be, why would it be incorrect, and how 
would the answer differ from the answer in part a? 

10. Prove that if G is a cycUc group of ordern with generator a, and q e {0, 1, n - l},then 
(j> + q)a = {p+„q)a 

' 15.2 Cosets and Factor Groups 

Consider the group [Z12, +12]. As we saw in the previous section, we can picture its cycUc properties with the string art of Figure 15.1.1. Here 
we wiU be interested in the non-generators, like 3. The soUd lines in Figure 15.2.1 show that only one-third of the tacks have been reached by 
starting at zero and jumping to every third tack. The numbers of these tacks correspond to (3) = {0, 3, 6, 9). 
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Figure 15.2.1 



What happens if you start at one of the unused tacks and again jump to every third tack? The two broken paths on Figure 15.2.1 show that 
identical squares are produced. The tacks are thus partitioned into very similar subsets. The subsets of Z12 that they correspond to are 
{0, 3, 6, 9), {1, 4, 7, 10), and {2, 5, 8, ll).These subsets are called cosets. In particular, they are called cosets of the subgroup 
{0, 3, 6, 9}. We will see that under certain conditions, cosets of a subgroup can form a group of their own. Before pursuing this example any 
further we will examine the general situation. 

Definition: Coset. lf[G, *] is a group, H < Ganda e G, the left coset of H generated by a is 

a *H = {a*h \ h e H). 
The right coset ofH generated by a is 

H*a ={h*a\ h eH]. 

Notes: 

(a) H itself is both a left and right coset since e*H = H*e = H. 

(b) If G is abeUan, a*H = H*a and the left-right distinction for cosets can be dropped. We will normally use left coset notation in that 
situation. 

Definition: Coset Representative. Any element of a coset is called a representative of that coset. 
One might wonder whether a is in any way a special representative ota*H since it seems to define the coset. It is not, as we shall see. 
Theorem 152.1. Ifb e a *H, then a *H = b *H, and ifb e H *a, then H *a = H *b. 

Remark; A Duality Principle. A duality principle can be formulated concerning cosets because left and right cosets are defined in such 
similar ways. Any theorem about left and right cosets will yield a second theorem when "left" and "right" are exchanged for "right" and "left." 

Proof of Theorem 15.2.1: In light of the remark above, we need only prove the first part of this theorem. Suppose that x e a*H. We need only 
find a way of expressing x as "fo times an element of H." Then we will have proven thata*H c b*H . By the definition of a*H, since b and x 
are in a there exist /zi and /!2 in such that 6 = a*hi itadx = a */!2. Given these two equations, a = bhr^ and 

X = a*h2 = (i )*/l2 = */!2 ). 

Since /!i,/!2 e/f, hi~^*h2 e and we are done with this part of the proof. In order to show that C a *H, one can follow essentially the 
same steps, which we will let the reader fill in. ■ 

Example 15.2.1. In Figure 15.2.1, you can start at either 1 or 7 and obtain the same path by taking jumps of three tacks in each step. 
Thus, 

1 +12 jO, 3, 6, 9) = 7 +12 |0, 3, 6, 9) = |1, 4, 7, 10). 

The set of left (or right) cosets of a subgroup partition a group in a special way: 

Theorem 15.22. If [G, *j is a group and H < G, the set of left cosets ofH is a partition of G. In addition, all of the left cosets ofH have 
the same cardinality. The same is true for right cosets. 

Proof: That every element of G belongs to a left coset is clear because a e a *// for all a e G.Jfa*H and b*H aie left cosets, they are either 
equal or disjoint. If two left cosets a*H and b*H are not disjoint, a*H f] b*H is nonempty and some element c belongs to the intersection. 
Then by Theorem 15.2.1, 
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cea*H => a*H = c*//and 
cGb*H ^ b*H = c*H. 
Hence a*H = b*H. 

We complete the proof by showing that each left coset has the same cardinality as H. To do this, we simply observe that if a e G, 
p.H a *// defined by p(/i) = a*/j is abijection and hence |^/| = |a *//|. We will leave the proof of this statement to the reader. ■ 

The function p has a nice interpretation in terms of our opening example. If a e Z„, the graph of (0, 3, 6, 9| is rotated 30 a° to coincide with 
one of the three cosets of (0, 3, 6, 9). 

A Counting Formula. If \G\ < oo and H < G, the number of distinct left cosets of H equals — . For this reason we use G/H to denote 

1^1 

the set of left cosets of H in G. 

Example 15.2.2. The set of integer multiples of four, 4Z, is a subgroup of [Z, +]. Four distinct cosets of 4Z partition the integers. 
They are 4 Z, 1 + 4 Z, 2 + 4 Z, and 3 + 4 Z, where, for example, l+4Z = (l+4fc|A:eZ). 4Z can also be written + 4 Z. 

Distinguished Representatives. Although we have seen that any representative can describe a coset, it is often convenient to select a distin- 
guished representative from each coset. The advantage to doing this is that there is a unique name for each coset in terms of its distinguished 
representative. In numeric examples such as the one above, the distinguished representative is usually the smallest nonnegative representative. 
Remember, this is purely a convenience and there is absolutely nothing wrong in writing -203 + 4 Z, 5 + 4 Z, or 621 + 4 Z in place of 1 + 4 Z 
because -203, 5, 621 e 1 + 4Z. 

Before completing the main thrust of this section, we will make note of a significant implication of Theorem 15.2.2. Since a finite group is 
divided into cosets of a common size by any subgroup, we can conclude: 

Lagrange's Theorem. The order of a subgroup of a finite group must divide the order of the group. 
One immediate implication of Lagrange's Theorem is that if p is prime, Zp has no proper subgroups. 

We will now describe the operation on cosets which will, under certain circumstances, result in a group. For most of this section, we will 
assume that G is an abelian group. This is one condition that guarantees that the set of left cosets will form a group. 

Definition: Operation on Cosets. Let C and D be left cosets of H, a subgroup of G with representatives c and d, respectively. Then 
C0D = c*H ®d*H = (c*d)*H 
The operation (g) is called the operation induced on left cosets by * . 

In Theorem 15.2.3, later in this section, we prove that if G is an abelian group, (gi is indeed an operation. In practice, if the group G is an 
additive group, the symbol ® is replaced by +, as in the following example. 

Example 15.2.3. Consider the cosets described in Example 15.2.2. For brevity, we rename + 4Z, 1 + 4Z,2 + 4Z, and 3 + 4Z with 
the symbols 0, 1,2, and 3. Let's do a typical calculation, 1+3. We will see that the result is always going to be , no matter what represen- 
tatives we select. For example, 9 e 1 , 7 e 3, and 9 + 7 = 16 e 0. Our choice of the representatives 1 and 3 were completely arbitrary. If 
you are reading this as a Mathematica Notebook, you can try out this demonstration that lets you select representatives of the two cosets by 
moving the sliders and see how the result is consistent. 







Your selection of a representative of 1 :9 


Good Choice! 


Your selection of a representative of 3 : 7 


Good Choice! 


The sum of representatives is 16 


The sum is in 



Since C i8> £) (or 1 + 3 in this case) can be computed in many ways, it is necessary to show that the choice of representatives does not affect 
the result. When the result we get ior C ® D is always independent of our choice of representatives, we say that "® is well defined." 
Addition of cosets is a well-defined operation on the left cosets of 4Z and is summarized in Table 15 .2.1 . Do you notice anything familiar? 
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TABIE 15.2.1 
Cowl Op»Tatlon— Table Of fxan>pt« 1S^.3 
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Example 15.2.4. Consider the real numbers. [R; +J, and its subgroup of integers, Z. Every element of K/Z has the same cardinality as 
Z. Let s, / e IR. i e J + Z if i can be written J + n for some n e Z. Hence i and / belong to the same coset if they differ by an integer. (See 
Exercise 6 for a generalization of this fact.) 

Now consider the coset 0.25 + Z. Real numbers that differ by an integer from 0.25 are 1.25, 2.25, 3.25, ... and -0.75, -1.75,-2.75... . If any 
real number is selected, there exists a representative of its coset that is greater than or equal to and less than 1. We will call that representa- 
tive the distinguished representative of the coset. For example, 43.125 belongs to the coset represented by 0.125; -6.382 + Z has 0.618 as its 
distinguished representative. The operation on R/Z is commonly called addition modulo 1 . A few typical calculations in IR/Z are 

(0.1 +Z) + (0.48 + Z) = 0.58 + Z , 

(0.7 + Z) + (0.31 +Z) = 0.01 + Z, 

and -(0.41 -I- Z) = -0.41 -I- Z = 0.59 -I- Z. 

In general, -(a + Z) = {I - a) + Z.. 

Example 15.2.5. Consider F = (Z4X Z2)///, where H = |(0, 0), (0, 1)|. Since Z4X Z2 is of order 8, each element of F is a coset 
containing two ordered pairs. We will leave it to the reader to verify that the four distinct cosets are 

(0, 0) + H, (1,0) +H,(2, 0) -I-//, and (3, 0) + H. 
The reader can also verify that F is isomorphic to Z4 , since F is cyclic. An educated guess should give you a generator. 

Example 15.2.6. Consider the group Z2'* = Z2XZ2XZ2XZ2 . Let H be ((1 , 0, 1, 0)), the cyclic subgroup of Z2'' generate by (1,0,1,0). 
Since 

(1,0, 1, 0)-l-(l,0, 1, 0) = (1 +2 1,0-1-2 0, 1 4-2 1,0-1-2 0) = (0,0,0,0) 

The order of // is 2 and , Z2''/ H has \Z\ /H\ = = y = 8 elements. A typical coset is 

C = (0, 1, 1, l)+H = 1(0, 1, 1, 1), (1, 1, 0, 1)). 

Since 2(0, 1, 1, 1) = (0, 0, 0, 0), 2C = //, the identity for the operation Z2'*///. The orders of all non-identity elements of Z2*/// are 
all 2, and it can be shown that the factor group is isomorphic to Z2^ . 

Theorem 15.23. If G is an abelian group, and H < G, the operation induced on cosets ofH by the operation of G is well defined. 

Proof: Suppose that a, b, and a', b' are two choices for representatives of cosets C and D. That is to say that a, a' e C,b, b' eD. We 
will show that a*b and a'-*b' are representatives of the same coset . Theorem 1 5 .2 . 1 implies that C = a *H and D = b*H, thus we have 

a' e a*Handb' e b*H. 

Then there exists ft 1 , h2 e // such that a' = a-thi and/)' = b*h2 and so 

a't-b' = (a*h\)*(b*h2) 
= (a*b)*(h\*h2) 

by various group properties and the assumption that H is abelian, which lets us reverse the order in which b and /jiappear. This last expres- 
sion for a ' * ^' implies that a' *b' e (a*b)*H since /i 1 * /i 2 e H because H is as subgroup of G. ■ 

Theorem 152.4. Let G be a group and H < G. If the operation induced on left cosets ofH by the operation of G is well defined, then the 
set of left cosets forms a group under that operation. 

Proof: Let Ci, C2, and C3 be the left cosets with representatives r^, ^2, and ^3, respectively. The values of C] (C2 <S) C3) and 
(Ci ® C2 ) » C3 are determined by ri * (^2 * ^3) and (ri * ^2 ) * ^3 .By the associativity of * in G, these two group elements are 
equal and so the two coset expressions must be equal. Therefore, the induced operation is associative. As for the identity and inverse 
properties, there is no surprise. The identity coset is // , or e*H, the coset that contains G's identity. If C is a coset with representative a, that 
is,if, C = a*//, then C"' is a"' *//. 

(a*H)iS> (a^^ *II) = (a*a^^)*H = e*H = identity coset. 

Definition: Factor Group. Let G be a group and H < G. If the set of left cosets of H forms a group, then that group is called the factor 
group of G modulo H. It is denoted G I H. 

Note: If G is abelian, then every subgroup of G yields a factor group. We will delay further consideration of the non-abelian case to Section 
15.4. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial- Share A like 3.0 United States License. 



Chapter 15 - Group Theory and Applications 



369 



Remark on Notation: It is customary to use the same symbol for the operation of G/H as for the operation on G. The reason we used distinct 
symbols in this section was to make the distinction clear between the two operations. 



EXERCISES FOR SECTION 15.2 
A Exercises 

1. Consider Zio and the subsets of Zio , {0, 1, 2, 3, 4} and {5, 6, 7, 8, 9}. Why is the operation induced on these subsets by modulo 10 
addition not well defined? 

2. Can you think of a group G, with a subgroup H such that \H\ = 6 and \G/H\ = 6? Is your answer unique? 

3. For each group and subgroup, what is G/H isomorphic to? 

(a) G = Z4 X Z2 and // = ((2, 0)>. Compare to Example 15.2.5. 

(b) G = [C, +] rnidH = R. 

(c) G = Z20 and H = <8> . 

4. For each group and subgroup, what is G/H isomorphic to? 

(a) G = ZxZ md H = {{a, a) | a e Z}. 

(b) G = [R', •] and H = {1, -1}. 

(c) G =Z2' and = <(1, 1, 1, 1, 1)>. 

B Exercises 

5. Prove that if G is a group, // < Ganda, b e G, a*H = h*H if and only if h^'ta eH. 

6. (a) Real addition modulo r, r > 0, can be described as the operation induced on cosets of (r) by ordinary addition. Describe a system of 
distinguished representatives for the elements of R/<r>. 

(b) Consider the trigonometric function sine. Given that sin (x + lnk) = sin x for all x e R and A: e Z, show how the distinguished representa- 
tives of R/(2;r) can be useful in developing an algorithm for calculating the sine of a number. 

1 15.3 Permutation Groups 

At the risk of boggling the reader's mind, we will now examine groups whose elements are functions. Recall that a permutation on a set A is a 
bijection from A into A. Suppose that A = {1, 2, 3). There are 3 ! =6 different permutations on A. We will call the set of all 6 permutations 
S3. They are listed in Table 15.3.1. The matrix form for describing a function on a finite set is to list the domain across the top row and the 
image ofeach element directly below it. For example ri(l) = 2. 



'-{1 2 3) ^'-(1 3 2) 

'' = (2 3 1 ) = ( 3 2 1 ) 

/ 1 2 3 \ / 1 2 3 \ 

'' = [3 1 2} ^'=[2 1 3 j 

Table 153.1 

Elements of S3 



The operation that will give ^1,^2, fi , /2> /3I a group structure is function composition. Consider the "product" ri "fy. 

n^/sd) =r,(/3(l))= ri(2) = 3 
'■i°./3(2) = r,(,/3(2))= r,(l)= 2 
'•i°./3(3) = r,(,/3(3))= r,(3)= 1 

The images of 1, 2, and 3 under rio/3 and /2are identical. Thus, by the definition of equality for functions, we can say riof^ = f2 . The 
complete table for the operation of function composition is given in Table 15.3.2. We don't even need the table to verify that we have a group: 

(a) Function composition is always associative (see Chapter 7). 

(b) The identity for the group is / . If g is any one of the permutations on A and x e A, 

g°i(x) = g(i(x)) = g(x) 

and 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



370 



Chapter 15 - Group Theory and Applications 



!og(x) = i{g{x)) = 
Therefore go; = !og = g. 

(c) A permutation, by definition, is a bijection. In Chapter 7 we proved that this implies that it must have an inverse and the inverse itself is a 
bijection and hence a permutation. Hence all elements of 53 have an inverse in 53. If a permutation is displayed in matrix form, its inverse can 
be obtained by exchanging the two rows and rearranging the columns so that the top row is in order. The first step is actually sufficient to obtain 
the inverse, but the sorting of the top row makes it easier to recognize the inverse. 

Example 15.3.1. Lets consider a typical permutation on |1, 2, 3, 4, 5), 



f- 



1 2 3 4 5 
5 3 2 1 4 



/-' = 



53214\_/'12345 
12345J~l4325 1 



Note from Table 15.3.2 that this group is non-abelian. Remember, non-abelian is the negation of abelian. The existence of two elements that 
don't commute is sufficient to make a group non-abelian. In this group, ri and is one such pair: ri°fj = /2 while f3°ri = fi, so 
ri °h =f= f3°i'i. Caution: Don't take this to mean that every pair of elements has to have this property. There are several pairs of elements in S3 
that do commute. In fact, the identity, must commute with everything. Also every element must commute with its inverse. 
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Table 15.3.2 

Operation Table for 53 



Definition: Symmetric Group. Let Abe a nonempty set. The set of all permutations on A with the operation of function composition is 
called the symmetric group on A, denoted Sa- 

Our main interest will be in the case where A is finite. The size of A is more significant than the elements, and we will denote by 5^. the 
symmetric group on any set of cardinality k,k> 1 . 

Example 15.3.2. Our opening example, S3, is the smallest non-abelian group. For that reason, all of its proper subgroups are abelian: in 
fact, they are all cyclic. Figure 15.3.1 shows the Hasse diagram for the subgroups of S3. 




Figure 15.3.1 
Lattice diagram of subgroups of S3 

Example 15.3.3. The only abelian symmetric groups are Si and S2 , with 1 and 2 elements, respectively. The elements of S2 are 

'■ = (1 2) ""'^'^=(2 ^) 

S2 is isomorphic to Z2. 

Theorem 153.1. For k > 1, |S(.| =k'. and for k > 3, S/, is non-abelian. 

Proof The first part of the theorem follows from the extended rule of products (see Chapter 2). We leave the details of proof of the second part 
to the reader after the following hint. Consider/ in S^- where /(I) = 2,/ (2) = 3,/ (3) = l,and/(7) = 7 for 3 < j < n. Now define g in a 
similar manner so that when you compare /(g(l)) and g(f(l)) you get different results. ■ 
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Cycle Notation 

A second way of describing a pemutation is by means of cycles, which we wiU introduce first with an example. Consider / e Sg : 

n 2 3 4 5 6 7 8 \ 
•'"1 827654 isj 

Consider the images of 1 when /is applied repeatedly. The images /(l), /(/(!)), f(f(f(l))), ... are 8, 3, 7, 1, 8, 3, 7, .... If j > 1, In Figure 
15.3.2(a), this situation is represented by the component of the graph that consists of 1, 8, 3, and 7 and shows that the values that you get by 
repeatedly applying/ cycle through those values. This is why we refer to this part of/ as a cycle of length 4. Of course starting at 8, 3, or 7 
also produces the same cycle with only the starting valued changing. 



8 




7 



(a) (b) 

Figure 15.3.2 
Representations of cycies of iength 4. 

Figure 15.3.2(a) illustrates how the cycle can be represented in a visual manner, but it is a bit awkward to write.. Part (b) of the figure presents a 
more universally recognized way to write a cycle. In (b), a cycle is represented by a list where the image of any number in the list is its 
successor. In addition, the last number in the list has as its image the first number. 

The other elements of the domain of/, are never reached if you start in the cycle (1, 8, 3, 7), and so looking at image of these other numbers 
will produce numbers that are disjoint from the set {1, 8, 3, 7}. The other disjoint cycles off are (2), (4, 6), and (5). We can express/as a 
product of disjoint cycles: 

/= (1, 8, 3, 7) (2) (4, 6) (5) 

or 

/= (1,8, 3, 7) (4, 6) 
where the absence of 2 and 5 implies that / (2) = 2 and / (5) = 5. 

Disjoint Cycles. We say that two cycles are disjoint if no number appears in both cycles, as is the case in our expressions for / above. Disjoint 
cycles can be written in any order. Thus, we could also say that 

/= (4, 6) (1,8, 3, 7). 

Composing Permutations. We will now consider the composition of permutations written in cyclic form, again by an example. Suppose that 
/ = (1,8, 3, 7)(4, 6) and g = (1, 5, 6)(8, 3, 7, 4) are elements of Sg. To calculate /og, we start with simple concatenation: 

fog = (1,8, 3, 7) (4, 6)(1,5,6)(8, 3, 7,4). (P) 

Although this is a valid expression for f °g , our goal is to express the composition as a product of disjoint cycles as /and g were individually 
written. We will start by determining the cycle that contains 1. When combining any number of cycles, they are always read from right to left. 
The first cycle in (P) does not contain 1; thus we move on to the second. The image of 1 under that cycle is 5. Now we move on to the next 
cycle, looking for 5, which doesn't appear. The fourth cycle does not contain a 5 either; so f°g{X) = 5. At this point, we would have written 

f°g = (1,5 

on paper. We repeat the steps to determine /og(5). This time the second cycle of (P) moves 5 to 6 and then the third cycle moves 6 to 4. 
Therefore, f°g(5) = 4. We continue until the cycle (1, 5, 4, 3) is completed by determining that f°g(3) = 1. The process is then repeated 
starting with any number that does not appear in the cycle(s) that have already obtained. The final result for our example is 

fog = (1, 5, 4, 3) (6, 8, 7). 

Since /(2) = 2 andg(2) = 2,/og(2) = 2 and we need not include the one-cycle (2). 
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Video: For a video that illustrates this process, go to http://faculty.uml.edu/klevasseur/ads2/videos/cyclecompositioii/. 
Example 153^. 

(a) (1, 2, 3, 4)(1, 2, 3, 4) = (1, 3) (2, 4). 

(b) (1, 4)(1, 3)(1, 2) = (1, 2, 3, 4). 

Note that the cychc notation does not indicate the set which is being permuted. The examples above could be in 55, where the image of 5 is 5. 
This ambiguity is usually overcome by making the context clear at the start of a discussion. 

Definition: Transposition. A transposition is a cycle of length 2, 

Example 153.5. / = (1, 4) and g = (4, 5) are transpositions in S^. f°g = (1, 4, 5) and g =/ = (1, 5, 4) are not transpositions; 
thus, the set of transpositions is not closed under composition. Since = /=/ and g^ = g°g are both equal to the identity permutation, /and 
g are their own inverses. In fact, every transposition is its own inverse. 

Theorem 153 J. Every cycle of length greater than 2 can be expressed as a product of transpositions . 

Instead of a formal proof, we will indicate how the product of transpositions can be obtained. The key fact needed is that if (ci , 02, 03 a^) 

is a cycle of length k, it is equal to the following product ofk- \ cycles. 

Example 1 1 .3.4 (b) illustrates this fact. Of course, a product of cycles can be written as a product of transpositions just as easily by applying the 
rule above to each cycle. For example, 

(1, 3, 5, 7)(2, 4, 6) = (1, 7)(1, 5)(1, 3)(2, 6)(2, 4). 

Unlike the situation with disjoint cycles, we are not free to change the order of these transpositions. 

The proofs of the following two theorems appear in many abstract algebra texts . 

Theorem 1533. Every permutation on a finite set can he expressed as the product of an even number of transpositions or an odd 
number of transpositions , but not both. 

Theorem 15.3.3 suggests that 5„ can be partitioned into its "even" and "odd" elements. 

Example 153.6. The even permutations of S3 are i ,ri = and r2 = ■ They form a subgroup, {;, ri , ^2} of S3 . 

In general: 

Theorem 153.4. Let n >2. The set of even permutations in S„ is a proper subgroup of called the alternating group on [1, 2, . . .n}, 
denoted A„. The order ofA„ is y . 

Proof: In this proof, the letters s and t stand for transpositions and p, q are even noimegative integers. 

If f, g & A„, we can write the two permutations as products of even numbers of transpositions: 

f'>g=SiS2---Sptit2---tg 

Since p + q is even, fog e A„. Since A„ is closed With respect to function composition, we have proven that A„ is a subgroup of S„. by 
Theorem 11.5.2. To prove the final assertion, let B„ be the set of odd permutations and let r = (1, 2). Define 0: A„ -> B„ by 9(f) = f °t. 
Suppose that 6{f) = d(g). Then f°T = g°T and by the cancellation law, f = g. Hence, 6 is an injection. Next we show that 6 is also a surjec- 
tion. If /i e B„, h is the image of an element of A„. Specifically, h is the image of h°r. 

e(hoT) = (hoT)oT Why? 
= hoiTOT) Why? 
= hoi Why? 
= h Why? 

Since 6 is a bijection, |A„| = |B„| = y. ■ 

Example 15.3.8. Consider the sliding-tile puzzles pictured in Figure 15.3.3. Each numbered square is a tile and the dark square is a gap. 
Any tile that is adjacent to the gap can slide into the gap. In most versions of this puzzle, the tiles are locked into a frame so that they can be 
moved only in the manner described above. The object of the puzzle is to arrange the tiles as they appear in Configuration a. Configurations 
b and c are typical starting points. We propose to show why the puzzle can be solved starting with b, but not with c. 
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(a) 



(b) 

Figure 15.3.3 
Configurations of tfie tile puzzie. 



(c) 



We will associate any configuration of tlie puzzle with an element of Sig. Imagine that a tile numbered 16 fills in the gap. If/is any configura- 
tion of the puzzle, / is Configuration a, and for 1 < k < 16, 

f (k) = the number that appears in the position of ^ of !. 
If we call Configurations b and c by the names fi and /2 respectively, 

/i = (1, 5, 3, 7) (2, 6, 4, 8) (9, 10) (11, 14, 13, 12) (15) (16) 



and 



/2 = (1, 5, 3, 7, 15)(2, 6, 4, 8)(9, 10)(11, 14, 13, 12)(16). 

How can we interpret the movement of one tile as a permutation? Consider what happens when the 12 tile of i slides into the gap. The result 
is a configuration that we would interpret as (12, 16), a single transposition. Now rf we slide the 8 tile into the 12 position, the result is or 
(8, 16, 12). Hence, by "exchanging" the tiles 8 and 16, we have obtained (8, 16) (12, 16) = (8, 16, 12). 
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Figure 15.3.4 
Tlie configuration (8, 16, 12). 

Every time you slide a tile into the gap, the new permutation is a transposition composed with the old permutation. Now observe that to start 
with i and terminate after a finite number of moves with the gap in its original position, you must make an even number of moves. Thus, any 
permutation that leaves 16 fixed, such as /i or f2, cannot be solved if it is odd. Note that f2 is an odd permutation; thus. Puzzle c can't be 
solved. The proof that all even permutations, such as /i , can be solved is left to the interested reader to pursue. 

Realizations of Groups. By now we've seen several instances a group can appear through an isomorphic copy of itself in various settings. 
The simplest such example is the cyclic group of order 2. When this group is mentioned, we might naturally think of the group [Z2, +2], but 
the groups L|-l , 1 !, J and [52, °J are isomorphic to it. None of these groups are necessarily more natural or important than the others. Which 
one you use depends on the situation you are in and all are referred to as realizations of the cyclic group of order 2. The next family of groups 
we will study has two natural realizations, first as permutations and second as geometric symmetries. 

Example 153.9. Dihedral Groups. The dihedral groups can realized in several ways and we wiU concentrate on two of them. They can be 
visualized as symmetries of a regular polygon — this is probably the easiest way to understand the groups. In order to represent the groups on a 
computer, it is convenient to represent the groups as subgroups of the symmetric groups, li k> 3, the dihedral group, Dj, , is a subgroup of Sj^. 
It is the subgroup of generated by the k- 

Realization as symmetries of regular polygons. 

We can describe D„ in terms of symmetries of a regular «-gon (n = 3: equilateral triangle, n = 4: square, n = 5: a regular pentagon, . . .). Here 
we will only concentrate on the case of D4. If a square is fixed in space, there are several motions of the square that wiU, at the end of the 
motion, not change the apparent position of the square. The actual changes in position can be seen if the comers of the square are labeled. In 
Figure 15.3.5, the initial labeling scheme is shown, along with the four axes of symmetry of the square. 
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Figure 15.3.5 
Axes of symmetry of the square. 

It might be worthwhile making a square like this with a sheet of paper. Be careful to label the back so that the numbers match up. Two motions 
of the square will be considered equivalent if the square is in the same position after performing either motion. There are eight distinct motions. 
The first four are °, 90 °, 180 °, and 270 ° clockwise rotations of the square, and the other four are the 180° flips along the axes Ij, and I4.. 
We 

will call the rotations /, ri, r2, and ^3, respectively, and the flips /i, /2, f^, and f-t, respectively. Figure 15.3.6 illustrates and /]. For future 
reference we also include the permutations to which they will correspond. 




n 2 3 4^ 
"b 3 4 ij 




12 3 4 
4 3 2 1 



Figure 15.3.6 
Two elements of D^ 



What is the operation on this set of symmetries? We will call the operation "followed by" and use the symbol * to represent it. The operation 
will be combine motions, apply motions from right to left, as with functions. We will illustrate how * is computed by finding ri*f]. Starting 
with the initial configuration, if you perform the fi motion, and then immediately perform ri on the result, we get the same configuration as if 
we just performed /4, which is to flip the square along the line I4. Therefore, ri */i = /a . 

Realization as permutations. 

We can also realize the dihedral groups as permutations. For any symmetric motion of the square we can associate with it a permutation. In 
the case of D4, the images of each of the numbers 1 through 4 are the positions on the square that each of the corners 1 through 4 are moved to. 
For example, since corner 4 moves to position 1 when you perform ri , the corresponding function will map 4 to 1 . In addition, 1 gets mapped 
to 2, 2 to 3 and 3 to 4. Therefore, ri is the cycle (1,2,3,4). The flip /i transposed two pairs of corners and corresponds to (1 , 4) (2, 3). If we 
want to combine these two permutations, using the same names as with motions, we get 
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rio/i= (1,2, 3,4)0(1,4) (2, 3) = (1) (2, 4) (3) = (2,4). 
Notice that this permutation is corresponds withe flip /4. 

Although D4 isn't cyclic (since it isn't abelian), it can be generated from the two elements ri and /i: 

D4 = {r\, f\) = (/, r,, ri^, ^,^ /,, r\°fu r,^ of, , r,^ of,] 
It is quite easy to describe any of the dihedral groups in a similar fashion. Let 

r= (1,2, n), ann-cycle, and 

/= (l,«)(2,n-l)... 
ThenD„= (r, f) = {i,r,r^, ... , r"-\ f , ro f , r^oj\ r"-'o/) 

An application of D4. One appUcation of D4 is in the design of a letter-facing machine. Imagine letters entering a conveyor belt to be 
postmarked. They are placed on the conveyor belt at random so that two sides are parallel to the belt. Suppose that a postmarker can recognize a 
stamp in the top right comer of the envelope, on the side facing up. In Figure 15.3.7, a sequence of machines is shown that wiU recognize a 
stamp on any letter, no matter what position in which the letter starts. The letter P stands for a postmarker. The letters R and F stand for rotating 
and flipping machines that perform the motions of r, and /i . 




Deliver 



Letters PRPRPRPFPRPRPRP —►Reject 

Figure 15.3.7 
A letter facer 

The arrows pointing up indicate that if a letter is postmarked, it is taken off the conveyor belt for deUvery. If a letter reaches the end, it must not 
have a stamp. Letter-facing machines like this have been designed (see GaUian's paper). One economic consideration is that R-machines tend to 
cost more than F-machines. R-machines also tend to damage more letters. Taking these facts into consideration, the reader is invited to design a 
better letter-facing machine. Assume that R-machines cost $800 and F-machines cost $500. Be sure that aU comers of incoming letters wiU be 
examined as they go down the conveyor belt. 

EXERCISES FOR SECTION 15.3 
A Exercises 

1 . Given 

/1234\ ( I 2 3 4\ /'1234\ 
■^ = (2 1 4 3j'^ = i2 3 4 ir"" = i3 2 4 1 J' 
compute 

(a) f°8 

(b) g°h 

(c) (J°g)°h 

(d) fo(goh) 

(e) /i-i 

(f) h-^goh 

(g) r' 

2. Write/, g, and h from Exercise 1 as products of disjoint cycles and determine whether each is odd or even. 

3. Do the left cosets of A3 = {;, ri, over 53 form a group under the induced operation on left cosets of A3? What about the left cosets of 
</i>? 

4. In its reaUzation as permutations, the dihedral group D3 is equal to ^3. Can you give a geometric explanation why? Why isn't D4 equal to 

54? 

B Exercises 

5. (a) Complete the Ust of elements of D4 and write out a table for the group in its reaUzation as symmetries. 
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(b) List the subgroups of D4 in a lattice diagram. Are they all cyclic? To what simpler groups are the subgroups of D4 isomorphic? 

6. Design a better letter-facing machine (see Example 15.3.9). How can you verify that a letter-facing machine does indeed check every comer 
of a letter? Can it be done on paper without actually sending letters through it? 

7. Prove by induction that if r > 1 and each f;, is a transposition, then 

(fjofjo.-.of^)-! = t^o...ct20t^ 

8. How many elements are there in D5 ? Describe them geometrically. 

9 . Complete the proof of Theorem 15.3.1. 

10. How many left cosets does A„, n > 2 have? 

11. Prove that in D„, /or = r""^ 0/ 

C Exercise 

12. (a) Prove that the tile puzzles corresponding to A 16 n 1/ e -Sie | /(16) = 16) are solvable, 
(b) If / (16) 16, how can you determine whether f's puzzle is solvable? 

13. (a) Prove that 53 is isomorphic to if 3, the group of 3 x3 rook matrices (see Section 11.2 exercises), 
(b) Prove that for each n > 2, i?„ is isomorphic to S„. 
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I 15.4 Normal Subgroups and Group Homomorphisms 



Our goal in this section is to answer an open question and introduce a related concept. The question is: When are left cosets of a subgroup a 
group under the induced operation? This question is open for non-abelian groups. Now that we have some examples to work with, we can try a 
few experiments . 

NORMAL SUBGROUPS 

Example 15.4.1 A3 = {/, ri, is a subgroup of ^3, and its left cosets are A3 itself and B3 = l/i, /2, /3I ■ Whether IA3 , B3 1 is a group 
boils down to determining whether the induced operation is well defined. Consider the operation table for S^m Figure 15.4.1. 
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fl 


fl 
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f-i 
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/l 


fl 


r\ 




I 



Figure 15.4.1 
Shaded operation table for S3 

We have shaded in all occurrences of the elements of B3 in gray. We will call these elements the gray elements and the elements of A3 the 
white ones. 

Now consider the process of computing the coset product A30B3. The "product" is obtained by selecting one white element and one gray 
element. Note that white "times" gray is always gray. Thus, A30B3 is well defined. Similarly, the other three possible products are well 
defined. The table for the factor group 53 /A3 is 






A3 


B3 


A3 


A3 


S3 


S3 


S3 


A3 



Clearly , 53 / A3 is isomorphic toe Z2 . Note that A3 and S3 are also the right cosets of A3 . This is significant . 

Example 15.4.2. Now let's try the left cosets of (/i) in S3. There are three of them. Will we get a complicated version of Z3 ? The left 
cosets are 

Co = </i) , C, = </,) = /3), and C2 = (/,) = (rj, /2) 

The reader might be expecting something to go wrong eventually, and here it is. To determine Ci°C2 we can choose from four pairs of 
representatives: 

r\ e Ci , ^2 e C2 — > ri or2 = / e Co 

rieCi,/2eC2 -^r,of2=feCo 

/3eCi,r2eC2 f3°r2 = fi e C2 

/3eCi,/2eC2 ^ /3°/2 = '-2eC2 

This time, we don't get the same coset for each pair of representatives. Therefore, the induced operation is not well defined and no factor 
group is obtained. 

Commentary: This last development changes our course of action. If we had gotten a factor group from [Co, Ci, C2), we might have hoped to 
prove that every collection of left cosets forms a group. Now our question is: How can we determine whether we will get a factor group? Of 
course, this question is equivalent to: When is the induced operation well defined? There was only one step in the proof of Theorem 15.2.3, 
where we used the fact that G was abelian. We repeat the equations here: 

a'*b' = (a*fti)*(b*/i2 ) = (a*b)*(fti */22), 
since G was abelian. 

The last step was made possible by the fact that hi*h = h *h]. As the proof continued, we used the fact that h\*h2 was in H and so a' *b' is 
(a*b)*h for some h in H. All that we really needed in the "abelian step" was that 

hi *b = ^ * (something in //) = b*hj, . 

Then, since H is closed under G's operation, hi,*h2 is an element of H. The consequence of this observation is included in the following 
theorem, the proof of which can be found in any abstract algebra text. 

Theorem 15.4.1. If H < G, then the operation induced on left cosets of H by the operation of G is well def ined if and only if any one of 
the following conditions is true: 
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(a) Ifh £ H,a e G, then there exists h' e H such that h*a = a*h\ 

(b) Ifh eH,a eG,thena~' *h*a e H. 

(c) Every left coset ofH is equal to a right coset ofH. 

Corollary 15.4 J. IfH < G, then the operation induced on left cosets ofH by the operation ofG is well defined if either of the follow- 
ing two conditions is true. 

(a) Gis abelian. 

(b) \H\ = ^f. 

Example 15.4.3. The right cosets of (/i> < ^3 are {;', /i), {ri/2}, and {r2 , /a). These are not the same as the left cosets of </i>. In 

addition, /z"' ./i = fi ft fi = fi <t{fi) ■ 

Definition: Normal Subgroup. If G is a group, H < G, then H is called a normal subgroup of G, denoted H < G, if it satisfies any of 
the conditions of Theorem 15.4.1. 

Example 15AA. The improper subgroups (e) and G of any group G are normal subgroups. G/(e) is isomorphic to G. All other normal 
subgroups of a group, if they exist are called proper normal subgroups. 

Example 15.4.5. By Condition b of Corollary 15.4.2, A„ is a normal subgroup of 5„ and S„/A„ is isomorphic to Z2. 

Example 15.4.6. A5, a group in its own right with 60 elements, has many proper subgroups, but none are normal. Although this could be 
done by brute force, the number of elements in the group would make the process tedious. A far more elegant way to approach the verifica- 
tion of this statement is to use the following fact about the cycle structure of permutations. If / e 5„ is a permutation with a certain cycle 
structure, (Ti (T2 ■■■0"^., where the length of o"; is then for any g e S„, g"' °f°g , which is the conjugate off by g, will have a cycle 
structure with exactly the same cycle lengths. For example if we take / = (1, 2, 3, 4) (5, 6) (7, 8, 9) e 59 and conjugate by 
g = (1,3, 5, 7, 9), 

g-io/og = (1, 9, 7, 5, 3)o(l, 2, 3, 4)(5, 6) (7, 8, 9)o(l, 3, 5, 7, 9) 
= (1,4, 9, 2) (3, 6) (5, 8,7) 

Notice that the condition for normality of a subgroup H of G is that the conjugate of any element of H by an element of G must be remain in H. 

To verify that Aj has no proper normal subgroups, you can start by cataloging the different cycle structures that occur in A5 and how 
many elements have those structures. Then consider what happens when you conjugate these different cycle structures with elements of A5. 
An outline of the process is in the exercises. 

Example 15^.7. Let G be the set of two by two invertible matrices of real numbers. That is, 

G = |^^ ^ jl a,fo,c,rfelR, arf-6c+ 0} 
We saw in Chapter 1 1 that G is a group with matrix multiplication. 

«, = {(" = °)|„.,0) 

are both subgroups of G. Hi a normal subgroup of G, while H2 is not normal. 

Homomorphisms 

Think of the word isomorphism. Chances are, one of the first images that comes to mind is an equation something like 

e(x * y) = e(x) o e(y) (H) 

An isomorphism must be a bijection, but equation (H) is the algebraic feature of an isomorphism. Here we will examine functions that satisfy 
equations of this type. 

Many homomorphisms are useful since they point out similarities between the two groups (or, on the universal level, two algebraic systems) 
involved. 

Consider the groups [R' , +J and [R', +]. Every time you use a camera, you are trying to transfer the essence of something three-dimensional 
onto a photograph— that is, something two-dimensional. If you show a friend a photo you have taken, that person can appreciate much of what 
you saw, even though a dimension is lacking. The "picture-taking" map is a function / : -» defined by fixi, X2 , x-}) = {xi, X2). This 
function is not a bijection, but it does satisfy the equation /(a: + y) = f{x) + f(y) for x = (xi, X2, X3) and y = (yi, y2, ys). Such a function is 
called a homomorphism, and when a homomorphism exists between two groups, the groups are called homomorphic that is, they are similar. A 
question that arises with groups, or other algebraic structures, that we claim are homomorphic, or similar, is: How similar are they? When we 
say that two groups are isomorphic— that is, identical— the map that we use to prove this is unimportant. However, when we say that two 
groups are homomorphic, the map used gives us a measure of the group's similarities (or dissimilarities). For example, the maps: 

/i : R' ^ R^ defined by /i(xi, X2, X3) = (_xi, X2, X3) , 
/2 : R^ -» R' defined by f2(.xi, X2, X3) = (xi, X2, 0) , and 
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/3 : r3 ^ IR3 defined by f3(xu X2, X3) = (0, 0, 0) 

are all homomorphisms. Think of them all as "picture-taking" maps, or cameras. The first camera gives us a three-dimensional picture, the 
ideal, actually an isomorphism. The second gives us the usual two-dimensional picture, certainly something quite worthwhile. The third 
collapses the whole scene onto a point, a "black dot," which gives no idea of the original structure. Hence, the knowledge that two groups are 
homomorphic doesn't give complete information about the similarities in the structures of the two groups. For this reason, the term homomor- 
phic is rarely used (unlike isomorphic), and the functions, the homomorphisms, are studied. 

Definition: Homomorphism. Let [G, *] and [G', be groups. 6 : G -> G' is a homomorphism if 6{x * y) = 9(x) o 0(y) for all x, y eG. 

Example 154.8. Define a : Zg -» Z3 by a(n) = n(l), where n e Zg and n (1) is the sum of n ones in Z3. Therefore, a(0) = 0, a(l) = 1, 
a(2) = 2,a(3) = 1 + 1 + 1 =0,a(4) = l,anda(5) = 2.1fn, m eZg, 

a(n +6 m) = (n +6 m)(l) 
= n(l) +3 m(l) 
= a(n) +3 a(m) 

Theorem 15.4.2. A few properties of homomorphisms are that if 6 : G -» G' is a homomorphism, then: 

(a) e(e) = eadentityof G) = identity of G' = e'. 

(b) e(a -1) = 0(a)-' for all a e G. 

(c) IfH < G,then6»(H) = {0(/j) \h&H]<G'. 

Proof: 

(a) Let a be any element of G. Then 9(a) e G'. 

d{a)o e' = d(a) by the definition of e ' 

= 0{a *e) by the definition of e 
= 6{a) 06(e) by the fact that 6* is a homomorphism 

By cancellation, e ' = 6(e). 

(b) Again, let a &G. 

e' = 6(e) = 6(a*a-^) = 6(a)o6(a-^). 
Hence, by the uniqueness of inverses, 6(a) ~^ = 0(a~'). 

(c) Let hi, h2 e 6 (H). Then there exists a\, a2&H such that 6(a\) = hi , 6(02) = 62- Recall that a compact necessary and sufficient condition 
for H < Gisthatx*j ' <=H for all x, y eH. Now we apply the same fact in G' : 

bioh2-' = e(ai)'>e(a2)-' 

= 6(ai)oe(a2-') 
= 6(ar*a2-^)ee(H) 

since ai *a2 ' £ and so we can conclude that 0(H) < G' . ■ 

Corollary. Since a homomorphism need not he a surjection and part (c) of Theorem 15.4.2 is true for the case of H = G, the range offf, 
0(G), is a subgroup ofG' 

Example 15.4.9. If we define n : Z Z/4Z by ;r(n) = n + 4Z. then is a homomorphism. The image of the subgroup 4Z is the 
single coset + 4Z, the identity of the factor group. Homomorphisms of this type are called natural homomorphisms. The following 
theorems will verify that ;r is a homomorphism and also show the coimection between homomorphisms and normal subgroups. The reader 
can find more detail and proofs in most abstract algebra texts. 

Theorem 15 A3. If H < G, then the function tr:G-> GjH defined by 7r(a) = aH is a homomorphism, called the natural 
homomorphism. 

Based on Theorem 15.4.3, every normal subgroup gives us a homomorphism. 

Definition: Kernel. Let : G G' be a homomorphism, and let e' be the identity ofG'. The kernel of 6 is the set 

ker0 = {aeG\ 0(a) =e'} 

Theorem 15.4.4. Let : G -> G' be a homomorphism from G into G'. The kernel of 0is a normal subgroup ofG. 

Based on Theorem 15.4.4, every homomorphism gives us a normal subgroup. 

Theorem 15.4S : Fundamental Theorem of Group Homomorphisms. Let : G -> G' be a homomorphism. Then 0(G) is isomorphic 
to Glker 6. 

Example 15.4.10. Define : Z -» Zio by 6(n) = the remainder from dividing n by 10. The three previous theorems imply the 
following: 

(15.4.3) ;r : Z -» Z/ 10 Z defined by ;r(n) = n + 10 Z is a homomorphism. 
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(15.4.4) {neZ|e(n) = 0} = {10n|n eZ}= lOZ < Z. 

(15.4.5) Z/ 10 Z is isomorphic to Zio . 

Example 15.4.11. Let G be the same group of two by two invertible real matrices as in Example 15.4.6. Define O : G ^ G by 
0(A) = ——^ . We will let the reader verify that $ is a homomorphism. The theorems above imply: 

V IdetAI 

(15.4.4) kerO = {A | $(A) =/} = |^^ ^j|aeR, a:#:0|<G. This verifies our statement in Example 15.4.6. As in that example, 

letker<I) = Hi. 

(15.4.5) G/Ht is isomorphic to (A e G | det A = ± 1). 

(15.4.3) n : G — > G///i defined, naturally, by ;7i-( A) =AH\ is a homomorphism. 

For the remainder of this section, we will be examining certain kinds of homomorphisms that will play a part in our major application to 
homomorphisms, coding theory. 

Example 15.4.12. Consider 0) : Z2^ ^ Z2' defined by <I)(a, b) = (a, b, a +2 b). If(ai,ii), (02, ^2) e Z2^, 

0((ai,fci) + (a2, b2)) = <t>iai +202,^1 +2 ^2) 

= (ai +2 02, bi +2 &2 , ai +2 02 +2 ^"1 +2 ^2) 
= (ai, , oi +2 fc|) + (02, ^2 . 02 +2*2) 
= $(ai, ^)i) + 'I'(a2 , ^'2) 

Since $(a, 6) = (0, 0, 0) implies that a = and b = 0, the kernel of O is {(0, 0)}. By previous theorems, 

^{l.2^) = ((0, 0, 0), (1, 0, 1), (0, 1, 1), (1, 1, 0)) is isomorphic to Z22 . 

We can generalize the previous example as follows: If n, m > 1 and A an m x n matrix of O's and I's (elements of Z2), then (I> : Z2" -> Z2" 
defined by 

$(ai, a2, .... a„) = (oi, 02, .... a„)A 

is a homomorphism. This is true because matrix multiplication is distributive over addition. The only new idea here is that computation is 
done in Z2 where 1 +2 1 =0. If a = (ai, 02 , a„) and b = {bi, b2, . . . , b „), (a + b) A = a A + b A \s true by basic matrix laws. 
Therefore, $ (a + b) = ^{q) + <b{b). 

EXERCISES FOR SECTION 15.4 
A Exercises 

1 . Which of the following functions are homomorphisms? What are the kernels of those functions that are homomorphisms? 

(a) 61 : R* ^ R+ defined by Oiia) = \a\. 

if n is even 
I is odd 

(c) 63 : R X R ^ R, where 6*3(0, b) = a + b. 

(d) ^^4 : 54 ^ 54 defined by Odf) = f°f = f ■ 

2. Which of the following functions are homomorphisms? What are the kernels of those functions that are homomorphisms? 

(a) Ofi : yW2x2(R) R, defined by a 1 (A) = A11A22 + ^12^218- 

(b) 0-2 : (R*)^ -> R* defined by a2 (a, b) = ah. 

(c) 0-3 : (A e M2x2(R) | det A 0) ^ R*, where a3(A) = det A. 

(d) 0:4 : 54 -> 54 defined by a4(/) = /"^ 

3. Show that D4 has one proper normal subgroup, but that <(1, 4) (2, 3)> is not normal. 

4. Prove that the function $ in Example 15.4.1 1 is a homomorphism. 

5. Define the two functions a : 1-2 -» Z2^ and j8 : Z2^ -» Z2 by 

a(a\, 02, 03 ) = (fli, 0.2, 03 , a\ +2 02 +2 03), and 
Pibi, b2, bi, bi) = bi+b2 + bj, + bA 
Describe the function /Jo a. Is it a homomorphism? 

6 . Express $ in Example 1 5 .4 . 1 2 in matrix form . 



(0 if n ii 

(b) 62 : ^8 ^ ^2 where e2(n) = , . . . 

11 It ni 
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B Exercises 

7. Prove that if G is an abelian group, then q (x) = defines a homomorphism from G into G. Is q ever an isomorphism? 

8. Prove that ife : G ^ G' is a homomorphism, and //< C, then (? (//) < 6 (G). Is it also true that (?(//) < G'? 

9. Prove that ife : G ^ G' is a homomorphism, and H' < (9 (G), then r '(//') = |a e G | e(a)eH']< G. 

C Exercises 

10. Following up on Example 1 1 .4.6, prove that A5 is a simple group; i. e., it has no proper normal subgroups. 

(a) Make a list of the different cycle structures that occur in A5 and how many elements have those structures. 

(b) Within each set of permutations with different cycle structures, identify which subsets are closed with respect to the conjugation operation. 
With this you will have a partition of A5 into conjugate classes where for each class C, 

f, ge C if and only if 3(f) eA^ such that cl}^^ofo(p = g 

(c) Use the fact that a normal subgroup of A5 needs to be a union of conjugate classes and verify that no such union exists. 



1 15.5 Coding Theory— Group Codes 



In this section, we will introduce the basic ideas involved in coding theory and consider solutions of a coding problem by means of group codes. 

A Transmission Problem. Imagine a situation in which information is being transmitted between two points. The information takes the form 
of high and low pulses (for example, radio waves or electric currents), which we will label 1 and 0, respectively. As these pulses are sent and 
received, they are grouped together in blocks of fixed length. The length determines how much information can be contained in one block. If 
the length is r, there are 2^ different values that a block can have. If the information being sent takes the form of text, each block might be a 
character. In that case, the length of a block may be seven, so that 2^ = 128 block values can represent letters (both upper and lower case), 
digits, punctuation, and so on. Figure 15.5.1 illustrates the problem that can be encountered if information is transmitted between two points. 
During the transmission of data, noise can alter the signal so that what is received differs from what is sent. 



001101 
is sent 




Noise 





0001001 
is received 


>- 





Figure 15.5.1 
A noisy transmission 

Noise. Noise is a fact of life for anyone who tries to transmit information. Fortunately, in most situations, we could expect a high percentage 
of the pulses that are sent to be received properly. However, when large numbers of pulses are transmitted, there are usually some errors due to 
noise. For the remainder of the discussion, we will make assumptions about the nature of the noise and the message that we want to send. 
Henceforth, we will refer to the pulses as bits. 

Diiiaiy 
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Figure 15.5.2 
The Coding Process 



Message 
if all goes right 



Binary Symmetric Chiannels 

We will assume that our information is being sent along a binary symmetric channel. By this we mean that any single bit that is transmitted will 
be received improperly with a certain fixed probability,/?. The value of p is usually quite small. To illustrate the process, we will assume that 
p = 0.001, which, in the real world, would be considered somewhat large. Since 1 - p = 0.999, we can expect 99.9% of all bits to be 
properly received. 

Suppose that our message consists of 3,000 bits of information, to be sent in blocks of three bits each. Two factors will be considered in 
evaluating a method of transmission. The first is the probability that the message is received with no errors. The second is the number of bits 
that will be transmitted in order to send the message. This quantity is called the rate of transmission: 



Rate : 



Message length 
Number of bits Iransmilted 
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As you might expect, as we devise methods to improve the probabUity of success, the rate will decrease. 
Case 1: Raw information. Suppose that we ignore the noise and transmit the message "as is." The probabihty of success is 
0.9993000 ^ 0.0497124 

Therefore we only successfully receive the message totally correct less than 5% of the time. The rate of 3000/3000 = 1 certainly doesn't offset 
this poor probabihty. 

The Coding Process 

Our strategy for improving oiu" chances of success wiU be to send an encoded message across the binary symmetric chaimel. The encoding wiU 
be done in such a way that small errors can be identified and corrected. This idea is illustrated in Figure 15.5.2. 

In our examples, the functions that wiU correspond to our encoding and decoding devices wiU all be homomorphisms between Cartesian 
products of Z2. 

Case 2: An Error-Detecting Code. Suppose that each block of three bits a = (a 1 , 02, a 3 ) is encoded according to the function 

e: ^Z2'*4 , 

where 

e(a) = (ai, 02, 03. ai +202+203)- 

When the encoded block is received, the first three bits are probably part of the message (it is correct approximately 99.7% of the time), but the 
added bit that is sent will make it possible to detect single errors in the block. Note that when e (a) is transmitted, the sum of its components is 

a 1+2 a 2 +2 a 3 +2 ( oi +2 02 +2 03) = 
since a, +ai=0inZ2. 

If any single bit is garbled by noise, the sum of the received bits will be 1 . The last bit of e (a) is called the parity bit. A parity error occurs if the 
sum of the received bits is 1 . Since more than one error is unhkely when p is small, a high percentage of all errors can be detected. 
At the receiving end, the decoding function acts on the four-bit block b = (bi, b 2 , b^, b^) according to 

d(,b) = {b\,b2, bi, 61 +2^2 +2 +2 b^). 

The fourth bit is called the parity-check bit. If no parity error occurs, the first three bits are recorded as part of the message. If a parity error 
occurs, we wiU assume that a retransmission of that block can be requested. This request can take the form of automatically having the parity- 
check bit of d{b) sent back to the source. If 1 is received, the previous block is retransmitted; if is received, the next block is sent. This 
assumption of two-way communication is significant, but it is necessary to make this coding system useful. It is reasonable to expect that the 
probability of a transmission error in the opposite direction is also 0.001. Without going into the details, we will report that the probability of 
success is approximately 0.990 and the rate is approximately 3/5. The rate includes the transmission of the parity-check bit to the source. 

Case 3: An Error-Correcting Code. For our final case, we will consider a coding process that can correct errors at the receiving end so that only 
one-way communication is needed. Before we begin, recall that every element of Z2'', n > 1, is its own inverse; that is, -b = b. Therefore, 
a - b = a + b. 

The three-bit message blocks are difficult to transmit because they are so similar to one another. If a and are in Z2^, their difference, a +2 b, 
can be thought of as a measure of how close they are. If a and b differ in only one bit position, one error can change one into the other. The 
encoding that we wiU introduce takes a block a = (a 1 , 02, 03 ) and produces a block of length 6 called the code word of a . The code words are 
selected so that they are farther from one another than the messages are. In fact, each code word wiU differ from each other code word by at 
least three bits. As a result, any single error wiU not push a code word close enough to another code word to cause confusion. Now for the 
details. Let 

n 1 1 ^ 
G= 1 1 1 
,001011 
be the generator matrix for the code, and 

a = (ai, 02, 03 ) 

Define e : l.-^ ^ Z2^ by 

e(a) = aG = (ai, 02, 03, 04, 05, ag) 



where 



04 = a\ +2 02 

05 = ai +2 03 
ae = 02 +2 03 
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Notice that e is a homomorphism. If a and b are distinct elements of then c = a + b has at least one coordinate equal to 1. Now consider 
the difference between e (a) and e (b): 

e{a) + e[b) = e(a + b) 

= e(c) 

= (ci, C2, C3. C4, C5. ce) 

Whether c has 1, 2, or 3 ones, e (c) must have at least three ones; therefore e (a) and e (b) differ in at least three bits. 

Now consider the problem of decoding the code words. Imagine that a code word, e(a), is transmitted, and b = (_bi, b2, b^, 64, ^5, b^) is 
received. At the receiving end, we know the formula for e (a), and if no error has occurred in transmission, 

bi = ai 



b2 = a2 
bi = 03 
b4 = a\ +2 ai 
bi = ai +2 03 

hi = +2 <^3 



h\ +2 ^2 +2 ''4 = 
=> b[ +2 hj +2hj = 
b2 +2 bi +2 &6 = 



The three equations on the right are called parity-check equations. If any of them is not true, an error has occurred. This error checking can be 
described in matrix form. Let 

/" 1 1 ^ 

1 1 

1 1 

1 
1 

lo 1 ; 



2^ by p (b) = bP. We call p (b) the syndrome of the received block. 



P is called the parity-check matrix for this code. Now define p : Z.2^ 
For example, 

p(0, 1, 0, 1, 0, 1) = (0, 0, 0) and p(l, 1, 1, 1, 0, 0) = (1, 0, 0) 

Note that p is also a homomorphism. If the syndrome of a block is (0, 0, 0), we can be almost certain that the message block is (bi , b2, 63). 

Next we turn to the method of correcting errors. Despite the fact that there are only eight code words, one for each three-bit block value, the set 
of possible received blocks is Z2', with 64 elements. Suppose that b is not a code word, but that it differs from a code word by exactly one bit. 
In other words, it is the result of a single error in transmission. Suppose that w is the code word that b is close to and that they differ in the first 
bit. Then 

b + w = (1,0, 0, 0, 0, 0) 

and 

pib) = p{b) + p(yv) since p (w) = (0, 0, 0) 
= p{b + w) since p is a homomorphism 

= p(l, 0, 0, 0, 0, 0) 
= (1, 1,0) 

Note that we haven't specified b or w, only that they differ in the first bit. Therefore, if b is received and p{b) = {i, 1, 0), the transmitted code 
word was probably b + (I, 0, 0, 0, 0, 0) and the message block was {bi +2 1, b2, b^). The same analysis can be done if b and w differ in 
any of the other five bits. 

This process can be described in terms of cosets. Let W be the set of code words; that is, W = e(Z2^ ).W is a subgroup of Z2*. Consider the 
factor group Z2'/ W: 



IZzV^I 



M 



Suppose that bi and h2 are representatives of the same coset. Then bi = b2 + w for some w in W. Therefore, 

p(b 1) = p(bi) + p (w) since p (w) = (0, 0, 0) 
= p(bi + w) 

= P(h2) 

and so hi and ^2 have the same syndrome. 

Finally, suppose that di and d2 are distinct and both have only a single coordinate equal to 1. Then di + d2 has exactly two ones. Note that the 
identity of Z2*, (0, 0, 0, 0, 0, 0), must be in W. Since di + d2 differs from the identity by two bits, di +d2i W. Hence di and d2 belong to 
distinct cosets. The reasoning above serves as a proof of the following theorem. 
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Theorem 15 J. 1. There is a system of distinguished representatives of Z-f j W such that each of the six-bit blocks having a single 1 is a 
distinguished representative of its own coset. 

Now we can describe the error-correcting process. First match each of the blocks with a single 1 with its syndrome. In addition, match the 
identity of W with the syndrome (0, 0, 0) (see Table 15.5.1). Since there are eight cosets of W, select any representative of the eighth coset to be 
distinguished. This is the coset with syndrome (1,1,1). 

Syndrome Error Correction 

000 000000 

1 1 1 
1 1 1 
Oil 1 
1 1 
1 1 
1 1 
111 1 1 



Table 15.5.1 
Error Correction Table 



When block b is received, you need only: 



( 1 ) Compute the syndrome ,p{b), and 

(2) Add to b the error correction that matches p (b) . 

We win conclude this example by computing the probabihty of success for our hypothetical situation. It is 

(0.999« + 6x0.999' xO.OOl)^"*"' = 0.985151 . 
The rate for this method is r . 



EXERCISES FOR SECTION 15.5 
A Exercises 

1 . If the error-detecting code is being used, how would you act on the following received blocks? 

(a) (1, 0, 1, 1) 

(b) (1, 1, 1, 1) 

(c) (0, 0, 0, 0) 

2. Express the encoding and decoding functions for the error-detecting code using matrices. 

3 . If the error-correcting code is being used, how would you decode the following blocks? Expect a problem with one of these. Why? 

(a) (1,0,0,0, 1, 1) 

(b) (1, 0, 1, 0, 1, 1) 

(c) (0, 1, 1, 1, 1,0) 

(d) (0,0,0, 1, 1,0) 

4. Describe how the triple-repetition code with encoding function, e : Z2 -> Z2^, where e(a\) = (ai, ai, ai) can allow us to correct a single 
error. What is the probabihty of success for the p = 0.001 , 3000-bit situation? What are the generator and parity-check matrices for this code? 

B Exercise 
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5 . Consider the linear code defined the generator matrix 



(a) What size blocks does this code encode and what is the length of the code words? 

(b) What are the code words for this code? 

(c) With this code, can you detect single bit errors? Can you correct all, some, or no single bit errors? 

6. Rectangular codes. To build a rectangular code, you partition your message into blocks of length m and then factor m into k] ■k2 and 
arrange the bits in a by ^2 rectangular array as in the figure below (read "digit" as "bit"). Then you add parity bits along the right side and 
bottom of the rows and columns. The code word is read row by row. 

For example, if m is 4, then our only choice is a 2 by 2 array. The message 1 101 would be encoded as so 



1 


1 








1 


1 


1 








And the code word is the string 1 1001 1 10. 

(a) Suppose that you were sent four bit messages using this code and your received the following strings. What were the messages. 

(i) 11011000 

(ii) 01110010 

(iii) 10001111 

(b) If you encoded bits in this manner, what would be the rate of the code? 

(c) Rectangular codes are linear codes for the 3 by 2 rectangular code, what are the generator and parity check matrices? 



■ = message bit 
□ = parity bit 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 15 

Section 15.1 

1 . How does one find all subgroups of any cyclic group? Can this same process be used to determine all subgroups of noncycUc groups? 

2. Exercises of Section 15.1 tells us that Z2 x Z5 is isomorphic to Ziq. Use the Chinese Remainder Theorem to find an isomorphism between 
these two groups, 

3. Use the Chinese Remainder Theorem to add 74 and 85 in Z120. 
Section 15.2 

4. Let G be a group and assume |G| = 10. Can C have subgroups of order 2? ...of order 3? ... of order 4? Explain. 

5. List all left cosets of H = [0, 4, 8} in the group Z12 and write out the table for Zi2//^. 

6. Let G be a finite group of order n. Then for any a g 0,0" = e, where e is the identity of G. Interpret this statement for the groups [Zg, +§] 
and [{/(Zft), XfiJ 

7. (a) Consider Zg/(2). How many distinct left cosets of (2> in Zg are there? List them. 

(b) Repeat part a for Z 1 2 / <2) . 

(c) Is Z8/<2> isomorphic to Zi2/(2>? Explain. 

Section 15.3 

8. Determine all proper subgroups of the symmetric group S3 and draw a Hasse diagram for the relation "is a subset of." 

9. Let / e 5„. Prove that/is even if and only if /"' is even. 

10. (a) By analogy with the motions of a square, how many motions of a cube are there? 
(b) Design a "package-facing" machine using the group of motions of the cube. 

Section 15.4 

11. (a) Let [Bi, -1, Vi , Ai] and [82,-2, ^2, ^2] be Boolean algebras. Define a Boolean algebra homomorphism based on the definition of 
a group homomorphism. 

(b) Your definition in part a should result in properties similar to the ones of a group homomorphism. Let / : Bi -» B2 be a Boolean 
algebra homomorphism. Prove: 

(i) /(Oi) = O2 and/(li)= I2 

(ii) a < 6 => /(a) < f(b) V a, 6 e Bi and 

(iii) f(Bi) is a Boolean subalgebra of B2. 

12. (a) Prove the contentions of example 15.4.6 that Hi is a normal subgroup of GL(2, R) but that H2 is not. 

(b) In order to get a clearer picture of what GL (2, R)/SL (2, R) is, prove that the determinant function det : GL(2, R) -» R* is an onto 
homomorphism, and apply Theorem 15.4.5. 
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Section 1 5.5 

13. This exercise concerns a code called the Hamming (7, 4) code, an error-correcting code with rate 4/7. A four by seven generator matrix G 
encodes message blocks of length 4 according to the rule e(a) = a G, so that the parity check matrix for the code is 



r 1 


1 


^ 


1 





1 





1 


1 


1 


1 


1 


1 











1 





.0 





1 ; 



Thatis, fcisacode word iff foP =(0 ). 

(a) FindG. 

(b) Encode 1111 and 1001. 

(c) Compute the syndrome of the following received message blocks and correct them, if necessary: 

(i) 0100000 (ii) 1010101 (iii) 1011011. 

(d) Prove that this code does indeed correct all single bit errors . 

14. Given a code with parity check matrix P whose transpose is given below, identify the generator matrix, and the rate of the code. Prove that 
the code corrects all single errors. 



i 1 


1 





^ 


1 





1 





1 








1 





1 


1 








1 





1 








1 


1 


1 


1 


1 





1 


1 





1 


1 





1 


1 





1 


1 


1 


1 


1 


1 


1 


1 














1 














1 





.0 








1 ; 
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An Introduction to Rings and Fields 



GOALS 

In our early elementary school days we began the study of mathematics by learning addition and multiplication on the set of positive integers. 
We then extended this to operations on the set of all integers. Subtraction and division are defined in terms of addition and multiplication. Later 
we investigated the set of real numbers under the operations of addition and multiplication. Hence, it is quite natural to investigate those 
structures on which we can define these two fundamental operations, or operations similar to them. The structures similar to the set of integers 
are called rings, and those similar to the set of real numbers are called fields. 

In coding theory, highly structured codes are needed for speed and accuracy. The theory of finite fields is essential in the development of many 
structured codes. We will discuss basic facts about finite fields and introduce the reader to polynomial algebra. 

1 16.1 Rings— Basic Definitions and Concepts 

As mentioned in our goals , we would like to investigate algebraic systems whose structure imitates that of the integers . 

Definition: Ring. A ring is a set R together with two binary operations, addition and multiplication, denoted by the symbols + and ■ such 
that the following axioms are satisfied: 

(1) [R, +] is an abelian group . 

(2) Multiplication is associative on R. 

(3) Multiplication is distributive over addition; that is, for all a,b,ce R, the left distributive law, a(b + c) = ab + ac, and the right distribu- 
tive law, (b + c)a - ba + ca, hold. 

Comments: 

(1) A ring is designated as [R, +, •] or as just plain R if the operations are understood. 

(2) The symbols + and • stand for arbitrary operations, not just "regular" addition and multiplication. These symbols are referred to by the 
usual names . For simplicity , we will write a b instead of a • if it is not ambiguous . 

(3) For the abelian group [R, +], we use additive notation. In particular, the group identity is designated by rather than by e and is customar- 
ily called the "zero" of the ring. The group inverse is also written in additive notation: -a rather than a"' . 

We now look at some examples of rings. Certainly all the additive abelian groups of Chapter 11 are likely candidates for rings. 

Example 16.1.1. [Z, +, •] is a ring, where + and ■ stand for regular addition and multiplication on Z. From Chapter 11, we already 
know that [Z, +] is an abelian group, so we need only check parts 2 and 3 of the definition of a ring. From elementary algebra, we know that 
the associative law under multiplication and the distributive laws are true for Z. This is our main example of an infinite ring. 

Example 16.1.2. [Z„, +„, x„] is a ring. The properties of modular arithmetic on Z.„ were described in Section 1 1 .4, and they give us the 
information we need to convince ourselves that [Z„, +„, x„] is a ring. This example is our main example of finite rings of different 
orders. 
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Definition: Commutative Ring. A ring in which the commutative law holds under the operation of multiplication is called a commuta- 
tive ring. 

It is common practice to use the word abelian when referring to the commutative law under addition and the word commutative when referring 
to the commutative law under the operation of multiplication. 

Definition: Unity. A ring YR, + , •] that has a multiplicative identity is called a ring with unity. The multiplicative identity itself is called 
the unity of the ring. More formally, if there exists an element in R, designated by I, such that for all X eR,x-\ = I -x = x, then R is called 
a ring with unity. 

Example 16.13. The rings in Examples 16.1.1 and 16.1.2 are commutative rings with unity, the unity in both cases being the number 1. 
The ring [M2x2(R), + , •] is a noncommutative ring with unity, the unity being the identity matrix ^ = ^ q ^ j • 

DIRECT PRODUCTS OF RINGS 

Let i?i,i?2> •••> be rings under the operations +i, +2 ,...,+„ and -i, -2, respectively. Let 

n 

P= xRi 

i=l 

anda = {ai, 02, a„),b = (bi, b^, b„)&P. 

From Chapter 1 1 we know that P is an abelian group under the operation of componentwise addition: 

a + b = (ai +1 bi, 02 +2 ^2 , • • • , +n K)- 
We also define multiplication on P componentwise: 

a - b = {ai -ibu a2-2b2 a„ •„ fe„). 

To show that P is a ring under the above operations, we need only show that the (multiplicative) associative law and the distributive laws hold. 
This is indeed the case, and we leave it as an exercise. If each of the if,- is commutative, then P is commutative, and if each contains a unity, 
then P is a ring with unity, which is the n - tuple consisting of the unities of each of the Ri' s. 

Example 16.1.4. Since [Z4, +4, X4] and [Z3, +3, X3] are rings, then Z4XZ3 is a ring, where, for example, 
(2, 1) + (2, 2) = (2 +4 2, 1+3 2) = (0, 0) 

and 

(3, 2) -(2, 2) = (3 X42, 2 X32) = (2, 1). 
To determine the unity, if it exists, in the ring Z4XZ3, we look for the element (m, n) such that for all elements (a:, y) eZ4XZ3, 

{x, y) ={x, y)-(m, n) = (m, n)-(x, y), 
or, equivalently, 

(x X4 m, y X3 n) = (m X4 x, n X3 j) = (x, y). 

So we want m such that xX4m = mX4 x = x in the ring Z4. The only element m in Z4 that satisfies this equation is m = 1. Similarly, we 
obtain a value of 1 for n. So the unity of Z4 x Z3 , which is unique by Exercise 15 of this section, is (1, 1). We leave to the reader to verify 
that this ring is commutative. 

Hence, products of rings are analogous to products of groups or products of Boolean algebras. We now consider the extremely important 
concept of multiplicative inverses. Certainly many basic equations in elementary algebra (e.g., 2x = 3) are solved with this concept. We 
introduce the main idea here and develop it more completely in the next section. 

Example 16.1.5. The equation 2x = 3 has a solution in the ring [R, +, •] but does not have a solution in [Z, +, ■], since, to solve this 
equation, we multiply both sides of the equation 2x = 3 by the multiplicative inverse of 2. This number, 2"' exists in IR but does not exist in 
Z . We formalize this important idea in a definition which by now should be quite familiar to you. 

Definition: Multiplicative Inverses. Let [R, +, ■] be a ring with unity, 1. If u e R and there exists an element v e R such that 
u-v = v-u = 1, then u is said to have a multiplicative inverse, v. We call a ring element that possesses a multiplicative inverse a unit of the 
ring. The set of all units of a ring R is denoted by U(R). 

By Theorem 11.3.2, the multiplicative inverse of a ring element is unique, if it exists. For this reason, we can use the notation for the 
multiplicative inverse of m, if it exists. 

Example 16.1.6. In the rings [R, +, •] and [Q, +, •] every nonzero element has a multiplicative inverse. The only elements in Z that have 
multiplicative inverses are -1 and 1. That is, C/(R) = R*,C/(Q) = Q*,andC/(Z) = (-1, 1). 

Example 16.1.7. Let us find the multiplicative inverses, when they exist, of each element of the ring [Zf, +6 , x^]. If m = 3, we want an 
element v such that ux^v = 1. We do not have to check whether vX(,u= 1 since Zj is commutative. If we try each of the six elements, 0, 1, 
2, 3, 4, and 5, of Zg, we find that none of them satisfies the above equation, so 3 does not have a multiplicative inverse in Zg . However, 
since 5 XgS = 1,5 does have a multiplicative inverse in Zg , namely itself: 5~' = 5. The following table summarizes all results for Zg . 
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u u 

does not exist 

1 1 

2 does not exist 

3 does not exist 

4 does not exist 

5 5 

It shouldn't be a surprise that the zero of a ring is never going to have a multiplicative inverse except in the trivial case of /? = (0). 

Isomorphism is a universal concept that is important in every algebraic structure. Two rings are isomorphic as rings if and only if they have the 
same cardinality and if they behave exactly the same under corresponding operations. They are essentially the same ring. For this to be true, 
they must behave the same as groups (under + ) and they must behave the same under the operation of multiplication. 

Definition: Ring Isomorphism. Let [R, +, •] and [R\ +', ■'] be rings. Then R is isomorphic to R' if and only if there exists a 
map,f : R -> R\ called a ring isomorphism, such that 

(1) f is one-to-one and onto, 

(2) f (a -h b) = f{a) + ' f (b) for all a, b e R,and 

(3) f(a-b) = f(a) ■ 'fib) for all a.b e R. 

Conditions 1 and 2 tell us that/is a group isomorphism. Therefore, to show that two rings are isomorphic, we must produce a map, called an 
isomorphism, that satisfies the definition. Sometimes it is quite difficult to find a map that works. This does not necessarily mean that no such 
isomorphism exists, but simply that we cannot find it. 

This leads us to the problem of how to show that two rings are not isomorphic. This is a universal concept. It is true for any algebraic structure 
and was discussed in Chapter 1 1 . To show that two rings are not isomorphic, we must demonstrate that they behave differently under one of the 
operations. We illustrate through several examples. 

Example 16.1.8. Consider the rings [Z, +, ■] and [2Z, +, ■]• In Chapter 11 we showed that as groups, the two sets Z and 2Z with 
addition were isomorphic. The group isomorphism that proved this was the map / : Z -> 2Z, defined by f(n) = 2n. Is /a ring isomor- 
phism? We need only check whether /(m-n) = /(m)-/(n) for all m, n eZ: 

f(m -n) = 2- m-n and 

f(m) -fin) =2m-2n = 4-m -n 

Therefore, /is not a ring isomorphism. This does not necessarily mean that the two rings Z and 2Z are not isomorphic, but simply that the/ 
doesn't satisfy the conditions. We could imagine that some other function does. We could proceed and try to determine another function/ fo 
see whether it is a ring isomorphism, or we could try to show that Z and 2Z are not isomorphic as rings. To do the latter, we must find 

something different about the ring structure of Z and 2Z. 

We already know that they behave identically under addition, so if they are different as rings, it must have something to do with how they 
behave imder the operation of multiplication. Let's begin to develop a checklist of how the two rings could differ: 

(1) Do they have the same cardinality? Yes, they are both coimtable. 

(2) Are they both commutative? Yes. 

(3) Are they both rings with unity? No. 

Z is a ring with unity, namely the number 1 . 2Z is not a ring with unity, 1 ^ 2 Z. Hence, they are not isomorphic as rings. 

Example 16.1.9. Next consider whether [2Z, +, -J and [3 Z, +, -J are isomorphic. Because of the previous example, we might guess 
that they are not. However, checklist items 1 through 3 above do not help us. Why? We add another checklist item: 

(4) Find an equation that makes sense in both rings, which is solvable in one and not the other. 

The equation x + x = x ■ x, or 2x = x^, makes sense in both rings. However, this equation has a nonzero solution, x = 2, in 2Z, but does 
not have a nonzero solution in 3Z. Thus we have an equation solvable in one ring that cannot be solved in the other, so they cannot be 
isomorphic. 

Another universal concept that applies to the theory of rings is that of a subsystem. A subring of a ring [R, +, •] is any nonempty subset S of R 
that is a ring under the operations of R. First, for S to be a subring of the ring R, S must be a subgroup of the group [R, +]. Also, 5 must be 
closed under •, satisfy the associative law (under •), and satisfy the distributive laws. But since R is a ring, the associative and distributive laws 
are true for every element in R, and, in particular, for all elements in S, since S qR. We have just proven the following theorem: 

Theorem 16.1.1. A subset S of a ring [R, + , •] is a subring of R if and only if: 
(1) [5, +] is a subgroup of the group [R, +], which by Theorem 115.1, means we must show: 

(a) If a, b e S, then a -I- b £ S, 

(b) eS,and 
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(c))Ifa £ S, then -a e S. 

(2) S is closed under multiplication: if a, b e S, then a ■ b e S. 

Example 16.1.10. The set of even integers, 2 Z, is a subring of the ring [Z, +, •] since [2 Z, +] is a subgroup of the group [Z, +] and 
since it is also closed with respect to multipUcation: 

2m, 2n e 2Z => (2m)-(2n) = 2(2-m-n) e 2Z. 
Several of the basic facts that we are famUiar with are true for any ring. The following theorem Usts a few of the elementary properties of rings. 
Theorem 16.12. Let [R, +, -] be a ring, with a,b eR. Then 

(1) a -0 = ■ a = 

(2) a-(-b) = (-a) -b = -(a-b) 

(3) (-a)-(-b) =a-b 
Proof of Part 1: 

a • = a • (0 + 0) 

= a ■ + a ■ by the left distributive law. 

Hence if we add -{a ■ 0) to both sides of the above, we obtain a • = 0. Similarly, we can prove that ■ a = 0. 

Proof of Part 2: Before we begin the proof of part 2, recall that the inverse of each element of the group [R, +] is unique. Hence the inverse 
of the element a ■ b\s unique and it is denoted -(a • b). 

Therefore , to prove that a • ( — fc) = - (a • fc) , we need only show that a-(—b) inverts a-b. 

a-{-b) + a-b = a-(-b + b) by the distributive axiom 
= a • since - b inverts b 

= by part 1 of this theorem 

Similarly, it can be shown that (-a) ■ h = -(a ■ b). This completes the proof of part 2. 
We leave the proof of part 3 to the reader (see Exercise 16 of this section). ■ 
Example 16.1.11. We will compute 2 -(-2) in the ring [Zg, +§, Xg]. 

2X6 (-2) = -(2X6 2) = -4 = 2, 
since the additive inverse of 4 (mod 6) is 2. Of course, we could have done the calculation directly as 

2 Xfi (-2) = 2 X64 = 2. 

As the example above illustrates. Theorem 16.1.2 is a modest beginning in the study of which algebraic manipulations are possible in the 
solution of problems in rings. A fact in elementary algebra that is used frequently in problem solving is the cancellation law. We know that the 
cancellation laws are true under addition for any ring (Theorem 1 1 .3.5). 

Are the cancellation laws true under multiplication? More specifically, let [R, +, •] be a ring and let a, h, c e R with a. t 0. When can we 
cancel the a's in the equation a-b = a - cl We can certainly do so if a ' exists, but we cannot assume that a has a multiplicative inverse. The 
answer to this question is found with the following definition and Theorem 16.1.3. 

Definition: Divisors of Zero. Let [R, +, •] be a ring. If a and b are two nonzero elements ofR such that a-b = 0, then a and b are 
called divisors of zero. 

Example 16.1.12 (a) In the ring [Zg, +8, Xg], the numbers 4 and 2 are divisors of zero since 4 X82 = 0. In addition, 6 is a divisor of 

zero because 6 Xg 4 = 0. 

(b) In the ring [M2x2('R), +, -J the matrices A = ^ *^ j and ^ = ^ q q ) divisors of zero since AB = 0. 

Example 16.1.13. [Z, +, •] has no divisors of zero. 
Now here is why divisors of zero are related to cancellation. 

Theorem 16.13. The (multiplicative) cancellation law holds in a ring [R, +, ■] if and only if R has no divisors of zero. 

We prove the theorem using the left cancellation law, namely that if a :/= and a-b = a - c, then b = c for all a, b, c e R. The proof is 
similar using the right cancellation law. 

Proof: (=>) Assume the left cancellation law holds in R and assume that a and b are two elements in R such that a-b = 0. We must show that 
either a = or = 0. To do this, assume that a + and show that b must be 0. 

a-fo = => a-b = a-0 by Theorem 16.2. 1 , part 1 
b = by the cancellation law 
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(<=) Conversely, assume that R has no divisors of and we will prove that the cancellation law must hold. To do this, assume that a, b, c e R, 
a * 0, such that a ■ b = a ■ c and show that b = c. 

a ■ b = a ■ c =^ a ■ b - a ■ c = Why ? 
=> a-(b-c) =0 Why? 
^ b-c = Why 

^ b = c m 

Hence, the only time that the cancellation laws hold in a ring is when there are no divisors of zero. The commutative rings with imity in which 
the above is true are given a special name. 

Definition: Integral Domain. A commutative ring with unity containing no divisors of zero is called an integral domain. 
In this chapter. Integral domains wiU be denoted genericaUy by the letter D. 
We state the following two useful facts without proof. 

Theorem 16.1.4. The element m in the ring Z„ is a divisor of zero if and only if mis not relatively prime to n (i.e., gcdim, ri) 1). 

Corollary. Ifp is a prime, then Zp has no divisors of zero. 

Example 16.1.14. [Z, +, •]. [^p . +p, Xp] with p a prime, [Q, +, •], [R, +, •], and [C, +, •] are all integral domains. The key 
example of an infinite integral domain is [Z, +, •]• In fact, it is from Z that the term integral domain is derived. The main example of a finite 
integral domain is [Zp , +p , Xp ], when p is prime. 

We close this section with the verification of an observation that was made in Chapter 1 1 , namely that the product of two algebraic systems 
may not be an algebraic system of the same type. 

Example 16.1.15. Both [Z2 , +2, X2jand[Z3, +3, X3 J are integral domains. Consider the product Z2XZ3. It's true that Z2XZ3 is 
a commutative ring with unity (see Exercise 13). However, (1, 0)- (0, 2) = (0, 0), so Z2XZ3 has divisors of zero and is therefore not an 
integral domain. 

EXERCISES FOR SECTION 16.1 
A Exercises 

1. Review the definition of rings to show that the following are rings. The operations involved are the usual operations defined on the sets. 
Which 

of these rings are commutative? Which are rings with unity? For the rings with unity, determine the unity and all units. 

(a) [Z, +, •] 

(b) [C, +, •] 

(c) [M„x„(R), +, •] 

(d) [Q, +, •] 

(e) [M2x2(R), +, •] 

(f) [Z2, +2, X2] 

2. Follow the instructions for Exercise 1 and the following rings: 

(a) [Zg, +6, Xg] 

(b) [Z5, +5, X5] 

(c) [^2\ +, ■] 

(d) [Zg , +8 , Xg ] 

(e) [Z X Z, +, •] 

(f) [R\ +, •] 

3. Show that the following pairs of rings are not isomorphic: 

(a) [Z, +, •] and [M2x2(Z), +, ■] 

(b) [3Z, +, -land [4 Z, +, •]• 

4. Show that the following pairs of rings are not isomorphic: 
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(a) [IR, +, -landK, +, •]• 

(b) [Z2 XZ2, +, •]and[Z4, +, •]. 

5. (a) Show that 3Z is a subring of the ring [Z, +, ■] 

(b) Find all subrings of Zg. 

(c) Find all subrings of Z2 x Z2 . 

6. Verify the vaUdity of Theorem 16.1 .3 by finding examples of elements a, b, and c (a 0) in the following rings, where a ■ b = a ■ c and yet 
b + c\ 

(a) Zg 

(b) M2x2(IR) 

(c) Z22 

7. (a) Determine all solutions of the equation - 5 x + (> = m J.. Can there be any more than two solutions to this equation (or any 
quadratic equation) in Z? 

(b) Find all solutions of the equation in part a in Z12. Why are there more than two solutions? 

8. Solve the equation j;^+4j; + 4 = 0inthe following rings. Interpret 4 as 1 + 1 + 1 + 1, where 1 is the unity of the ring. 

(a) in Zg 

(b) inM2x2(IR) 

(c) in Z 

(d) in Z3 

B Exercises 

9. The relation "is isomorphic to" on rings is an equivalence relation. Explain the meaning of this statement. 

1 . Let i? 1 , i?2 > • • • > be rings . Prove the multipUcative , associative , and distributive laws for the ring 

R= xRi 

(=1 

(a) If each of the Ri is commutative, is R commutative? 

(b) Under what conditions wiU if be a ring with unity? 

(c) What win the units of if be when it has a unity? 

11. (a) Prove that the ring Z2 x Z3 is commutative and has unity. 

(b) Determine all divisors of zero for the ring Z2 x Z3 . 

(c) Give another example illustrating the fact that the product of two integral domains may not be an integral domain. Is there an 
example where the product is an integral domain? 

12. Boolean Rings. Let Ubea nonempty set. 

(a) Verify that [P(_U), ffi, fl is a commutative ring with unity. 

(b) What are the units of this ring? 

13. (a) For any ring [if , +, •], expand (a + b)(c + d)fora, b, c, d & R. 

(b) If R is commutative, prove that (a + hf- = + lab + for all a, b e R. 

14. (a) Let R be a commutative ring with unity. Prove by induction that for n > 1 , 

(a + 6)"= i (" la*/)"-* 

(b) Simplify (a + bf in Z5 . 

(c) Simplify (a + bf° in Ziq. 
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15. Prove: If i? is a ring with unity then this unity is unique. 

16. Prove part 3 of Theorem 16.1 .2. 

1 7 . Prove the Corollary to Theorem 1 6 . 1 .4 . 

18. Let i/ be a finite set. Prove that the Boolean ring [PCi/), ffi, ni is isomorphic to the ring [Z2", +, -J-wheren =\U\ 
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1 16.2 Fields 

Although the algebraic structures of rings and integral domains are widely used and play an important part in the applications of mathematics, 
we still cannot solve the simple equation ax = b, a + in all rings or in all integral domains. Yet this is one of the first equations we learn to 
solve in elementary algebra and its solubility is basic to innumerable questions. Certainly, if we wish to solve a wide range of problems in a 
system we need at least aU of the laws true for rings and the cancellation laws together with the ability to solve the equation ax = b, a =£ 0. We 
summarize the above in a definition and Ust several theorems without proof that will place this concept in the context of the previous section. 

Definition: Field. Afield is a commutative ring with unity such that each nonzero element has a multiplicative inverse. 
In this chapter, we denote a field genericaUy by the letter F. The letters k, K and L are also conventionally used for fields. 

Examplel6^.1. [Q, +, -IJK, +, ■], and [C, +, •] are aU fields. 
Reminder: Since every field is a ring, all facts and concepts that are true for rings are true for any field. 

Theorem 16.2.1. Every field is an integral domain. 
Of course the converse of Theorem 16.2.1 is not true. Consider [Z, +, •]. 

Theorem 1622. Every finite integral domain is afield. 

Theorem 1623. Ifp is a prime, then is afield. 
Theorem 16.2.3 is immediate from Theorem 16.2.2. 

Theorem 16.2.1 reminds us that the cancellation laws must be true for any field. Theorem 16.2.3 gives us a large number of finite fields, but we 
must be cautious. This theorem does not teU us that all finite fields are of the form Zp ,p a prime. To see this, let's try to construct a field of 

order 4. 

Example 16.2.2: a field of order 4. First the field must contain the additive and multiplicative identities, and 1, so, without loss of 
generality, we can assume that the field we are looking for is of the form F = {0, \, a, h]. Since there are only two nonisomorphic groups 
of order 4, we have only two choices for the group table for [F, +J. If the additive group is isomorphic to Z4 then two of the nonzero 
elements of F would not be their own additive inverse (as are 1 and 3 in Z4). Lets assume /? e F is one of those elements and yS + /? = y + 0. 
An isomorphism between the additive groups F and Z4 would require that y in F correspond with 2 in Z4. We could continue our 
argument and infer that 7-7 = 0, producing a zero divisor, which we need to avoid if F is to be a field. We leave the remainder of the 
argument to the reader. We can thus complete the addition table so that [F, +] is isomorphic to Z2^: 



+ 





1 


a 


b 








1 


a 


b 


1 


1 





b 


a 


a 


a 


b 





1 


b 


b 


a 


1 






Next, by Theorem 16.1.2, Part 1, and since 1 is the unity of F, the table for multipUcation must look hke: 








1 


a 


b 

















1 





1 


a 


b 


a 





a 






b 





b 







Hence, to complete the table, we have only four entries to find, and, since F must be commutative, this reduces our task to filling in three 
entries. Next, each nonzero element of F must have a unique multiplicative inverse. The inverse of a must be either a itself or b. If a"' = a, 
theni-i =^). (Why?)But 

a"' = a a ■ a = 1. And if a • a = 1, then a ■ b is equal to a or ft. In either case, by the cancellation law, we obtain a = 1 or = 1, 
which is impossible. Therefore we are forced to conclude that a"' = b and = a. To determine the final two products of the table, simply 
note that, a ■ a + a because the equation = x has only two solutions, and 1 in any field. We also know that a- a caimot be 1 because a 
doesn't invert itself and caimot be because a can't be a zero divisor. This leaves us with one possible conclusion, that a ■ a = b and 
similarly b ■ b = a. Hence, our multipUcation table for F is: 








1 


a 


b 

















1 





1 


a 


b 


a 





a 


b 


1 


b 





b 


1 


a 



The table Usting the multiplicative inverse of each nonzero element is: 
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u 




1 


1 


a 


b 


b 


a 



We leave it to the reader to convince him- or herself, if it is not already clear, that [F, +, •]. as described above, is a field. Hence, we have 
produced a field of order 4 and 4 is not a prime. 

This construction would be difficult to repeat for larger fields. In section 16.4 we will introduce a different approach to constructing fields that 
will be far more efficient. 

Even though not all finite fields are isomorphic to Zp, for some prime p it can be shown that every field F must have either: 

(1) a subfield isomorphic to Zp for some prime p, or 

(2) a subfield isomorphic to Q. 

In particular, if F is a finite field, a subfield of F must exist that is isomorphic to Zp. One can think of all fields as being constructed from either 
ZpOrQ. 

Example 16.2.3. [R, +, ■ ] is a field, and it contains a subfield isomorphic to [Q, +, •], namely Q itself. 

Example 16.2.4. The field F that we constructed in Example 16.2.2 should have a subfield isomorphic to Zp for some prime p. From 

the tables, we note that the subset |0, IjofjO, 1, a, 6) under the given operations of F behaves exactly like [Z2, +2, X2]. Hence, the field 
in Example 16.2.2 has a subfield isomorphic to Z2. Does it have a subfield isomorphic to a larger field, say Z3? We claim not and leave this 
investigation to the reader (see Exercise 3 of this section). 

We close this section with a brief discussion of isomorphic fields. Again, since a field is a ring, the definition of isomorphism of fields is the 
same as that of rings. It can be shown that if/is a field isomorphism, then /(a"' ) = /(a)"'; that is, inverses are mapped onto inverses under 
any field isomorphism. A major question to try to solve is: How many different non-isomorphic finite fields are there of any given order? If p is 
a prime, it seems clear from our discussions that all fields of order p are isomoi'phic to Zp. But how many nonisomorphic fields are there, if 
any, of order 4,6,8, 9, etc? The answer is given in the following theorem, whose proof is beyond the scope of this text. 
Theorem 16.2.4. 

(1) Any finite field F has order p" for a prime p and a positive integer n. 

(2) For any prime p and any positive integer n there is afield of order p" . 

(3) Any two fields of order p" are isomorphic. This field of order p" is frequently referred to as the Galois field of order p" and it is 
designated by GF(p"). 

Evariste Galois (181 1-32) was a pioneer in the field of abstract algebra. 




A French stamp honoring Evariste Galois (1811 -32) 

Theorem 16.2.4 tells us that there is a field of order 2^= 4 , and there is only one such field up to isomorphism. That is, all such fields of order 
4 are isomorphic to F, which we constructed in Example 16.2.2. 

EXERCISES FOR SECTION 16.2 
A Exercises 

1 . Write out the addition, multiplication, and "inverse" tables for each of the following fields'. 
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(a) [Z2, +2, X2] 

(b) [Z3, +3 , X3] 

(C) [Z5, +5 , X5] 

2. Show that the set of units of the fields in Exercise 1 fomi a group under the operation of the multiplication of the given field. Recall that a 
unit is an element which has a multiplicative inverse. 

3. Complete the argimient in Example 16.2.2 to show that if [F, +] is isomorphic to Z4, then F would have a zero divisor. 

4. Write out the operation tables for Z2^. Is Z2^ a ring? An integral domain? A field? Explain. 

5 . Determine all values x from the given field that satisfy the given equation: 

(a) X + 1 = -I over Z2 , Z3 and Z5 

(b) 2 ;c + 1 = 2 over Z3 and Z5 

(c) 3 ;c + 1 = 2 over Zj 

6. (a) Prove that if p and q are prime, then Z^ x Z^, is never a field. 

(b) Can Zp" be a field for any prime p and any positive integer n > 21 

7. The following are equations over Z2 . Their coefficients come solely from Z2 . Determine all solutions over Z2 ; that is, find all numbers in 
Z2 that satisfy the equations: 

(a) x^ + X = Q 

(b) + 1 = 

(c) x^+x^ + x+ l= Qi 

(d) :^ + X + \ = 

8. Determine the number of different fields, if any, of all orders 2 through 15. Wherever possible, describe these fields via a known field. 



B Exercise 
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1 16.3 Polynomial Rings 

In the previous sections we examined the solutions of a few equations over different rings and fields. To solve the equation - 2 = over the 
field of the real numbers means to find all solutions of this equation that are in this particular field K. This statement can be replaced as follows: 
Determine all a e R such that the polynomial f (x) = - 2 is equal to zero when evaluated at x = a. In this section, we will concentrate on 
the theory of polynomials. We will develop concepts using the general setting of polynomials over rings since results proven over rings are true 
for fields (and integral domains). The reader should keep in mind that in most cases we are just formalizing concepts that he or she learned in 
high school algebra over the field of reals. 

Definition: Polynomial over R. Let [R, +, ■]bearing. Apolynomial, f (x), over R is an expression of the form 

n 

fix) = YjOiX' =ao + aix +a2X? + ■■■ +a„xf , n >0, 

i=0 

where ag, aj a„ e R. If a„ ^ 0, then the degree of f (x) is n, Iff{x) = 0, then the degree off^x) is undefined and we assign the 

value -00 to the degree. If the degree off(x) is n, we write degf{x) = n. 

Comments: 

(1) The symbol x is an object called an indeterminate, which is not an element of the ring if. 

(2) The set of all polynomials in the indeterminate x with coefficients in i? is denoted by R[x] . 

(3) Note that R c i?[x]. The elements of if are called constant polynomials, with the nonzero elements of R being the polynomials of degree 0. 

(4) R is called the ground ring for R[x] . 

(5) In the definition above, we have written the terms in increasing degree starting with the constant. The ordering of terms can be reversed 
without changing the polynomial. For example, 1 + 2 x - 3 and -3 x"* + 2 x + 1 are the same polynomial. 

(6) A term of the form x* in a polynomial is understood to be 1 x*. 

Example 163.1. /(x) = 3, g(x) = 2 - 4x +7x^ , and h^x) = 2 + x'* are aU polynomials in Z[x]. Their degrees are 0, 2, and 4, 
respectively. 

Addition and multiplication of polynomials are performed as in high school algebra. However, we must do our computations in the ground ring 
over which we are considering the polynomials. 

Example 16.3.2. In Z3M, if f (x) = I + x and g(x) = 2 + x, then 

fix) + g{x) = (1+x) + (2 + x) 

= (1 +3 2)+ (1 +3 l)x 
= + 2x 
= 2x 

and 

/(x)g(x) = (l+x)-(2 +x) 

= (1 +x) • 2 + (1 +x) -x 
= IX32 + 2X + Ix + x-x 
= 2 + (2 +3 1) X + x^ 
= 2 + x^ 

However, for the same polynomials as above, / (x) and g (x) in Z[x], we have 

/(x) + g(x) = (1+x) + (2 + x) 

= (1 +2) + (1 + l)x 
= 3 + 2x 

and 

/(x)g(x) = (l+x)-(2 +x) 

= (1 +x) • 2 + (1 +x) -x 
= l-2 + 2x+ lx + x-x 
= 2 + (2 + l)x + x^ 
= 2 + 3 X + x^ 

The important fact to keep in mind is that addition and multiplication in R[x] depends on addition and multiplication in if. The x's merely serve 
the purpose of "place holders." All computations are done over the given ring. We summarize in the following theorem: 

Theorem 163.1. Let [R, +, ■] be a ring. Then: 
(I) R[x] is a ring under the operations of polynomial addition and multiplication, which depend on (are induced by) the operations in if. 
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(2) IfR is a commutative ring, then R[x] is a commutative ring. 

(3) IfR is a ring with unity, 1, then R[x] is a ring with unity (the unity in R[x] is I + x + Ox^ + ■■■). 

(4) IfR is an integral domain, then R[x] is an integral domain. 

(5) IfF is afield, then F[x] is not afield. However, F[x] is an integral domain. 

The proofs for Parts 1 through 4 are not difficult but rather long, so we omit them. For those inclined to prove them, we include the formal 
definitions of addition and multiplication in R[x] below. 

Proof Of Part 5: F[x] is not a field since for x e F[x], x"' = 1/x $ F[x]. Hence not all nonzero elements in F[x] have multiplicative inverses 
in F[x]. Every field F is an integral domain. By Part 4, F[x] is an integral domain. ■ 

Definition: Addition in R[x] . Let f(x) = ag + a\ x + a2X^ + • • • + a„, x"" and g(x) =bo + bi x + b2 x^ + ■■■ +b„x!^ be elements in R[x] 
so that a; E R and bj e R for all i. Let k be the maximum of m and n. Then 

f (x) + g{x) = Cq + Cl X + C2 X^ + ■■■ + Cj^X* 

where C; = O; + bifor i = 0, 1 , 2, . . . , k. 

Definition: Multiplication in R[x] . Let f (x) and g (x) be as above. Then 

f (x) ■ g{x) = do + di X + d2 x^ + ■■■ +dpxf where 
p = m + n, and 

s 

ds = ai b,-i 

= agbs +ai b^_i + 02 + ■■■ + ct^-i bj + a, bg 

for < s < p. 

Example 16.3.3. Let / (x) = 2 + x-andg(x) = -1 + 4x + 3 x^. We will compute / (x) • g (x) in Z[x]. Of course this product can be 
obtained by the usual methods of high school algebra. We will, for illustrative purposes, use the above definition. Using the notation of the 
above definition, ao = 2, a\ = 0, 02 = I, bo = - I, bi = 4, and ^2 = 3. We want to compute the coefficients do, d\, d2,dj,, and . We will 
compute dT, , the coefficient of the x^ term of the product, and leave the remainder to the reader (see Exercise 2 of this section). Since the 
degrees of both factors is 2, o; = bj= for / > 3. 

d^ = 00^3 + a\ b2 + ^2 ^1 + 03 ho 
= 2-0 + 0-3 + l-4 + 0-(-l)=4 

From high school algebra we all learned the standard procedure for dividing a polynomial / (x) by a second polynomial g (x). This process of 
polynomial long division is referred to as the division property for polynomials. Under this scheme we continue to divide until the result is a 
quotient q (x) and a remainder r (x) whose degree is strictly less than that of the divisor g (x). This property is valid over any field. 

Example 16.3.4. Let / (x) = 1 + x + x' and g (x) = 1 + x be two polynomials in Z2[x]. Let us divide / (x) by g (x). Keep in mind that 
we are in Z2W and that, in particular, -1 = 1 in Z2 . This is a case where reordering the terms in decreasing degree is preferred. 



+ 1 ) + Ox^ + X +1 



+ x^ 



Therefore, 



x^ + JC + 1 

X^ + X 

1 



2 1 

X^ + X + 

.v+ 1 



or equivalently, 

X-' + X + 2 = ( x^ + x) • (x + 1) + 1 

That is /(x) = g{x)-q(x) + r(x) where q{x) = x^ + x and r{x) = 1. Notice that deg()-(x)) = 0, which is strictly less than the 
deg(g(x)) = 1. 

Example 16.3.5. Let /(x) = 1 +x'*andg(x) = 1 + x be polynomials in Z2[x]. Let us divide f(x) by g(x): 
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+ +x +1 

jc + 1 ) X* + Oi' + Oi^ + Ox + 1 



+ 1 
+ 1 

+ x 

X +1 
X + 1 



Thus x" + 1 = (x^^ + + X + 1) (x + 1) . 

Since we have as a remainder, x + 1 must be a factor of x"* + 1, as in high school algebra. Also, since x + 1 is a factor of x* + 1, 1 is a 
zero (or root) of X* + 1. Of course we could have determined that 1 is a root of /(x) simply by computing /(I) =1"* +2 1 = 1 +2 1 = 0. 

Before we summarize the main results suggested by the previous examples, we should probably consider what could have happened if we had 
performed divisions of polynomials in the ring Z[x] rather than over the field Z2 . For example, /(x) = x^ - 1 and g (x) = 2x - 2 are both 
elements of the ring Z[x], yet 

1 1 1 

= -X + - 

2.V-1 2 2 

The quotient is not a polynomial over Z but a polynomial over the field Q. For this reason it would be wise to describe all results over a field 
F rather than over an arbitrary ring R. 

Theorem 1632. Division Property for F\x\. Let [F, +, •] be afield and let f (x) and g (x) be two elements o/F[x] with g(x) + 0. Then 
there exist unique polynomials q (x) and r (x) in F[x] such that / (x) = g (x) q{x) + r (x), where deg r(x) < deg g(x) . 

Theorem 16.3.2 can be proven by induction on deg / (x). 

Theorem 1633. Let [F, +, ■] be afield. An element a e F is a zero of f (x) e F[x] if and only if x - a is a factor of f (x) in F[x]. 

Proof: (^) Assume that a e F is a zero of /(x) e F[x]. We wish to show that x - a is a factor of /(x). To do so, apply the division 
property to fix) and g(x) = x - a. Hence, there exist unique polynomials q{x) and r{x) from F[x] such that /(x) = (x - a)-q(x) + r{x) 
and the degr(x) < deg(x - a) = 1, so r(x) = c e F, that is, r(x) is a constant. Also a is a zero of /(x) mean f(a) = 0. So 
fix) = (x - a) - q(x) + c becomes = / (a) = (a - a)q(a) + c. Hence c = 0, so /(x) = {x - a) ■ q(x), and x - a is a factor of /(x). 
The reader should note that a critical point of the proof of this half of the theorem was the part of the division property that stated that 
degr(x) < degg(x). 

(<=) We leave this half to the reader, exercise 6. ■ 

Theorem 163.4. A nonzero polynomial f (x) e F[x] of degree n can have at most n zeros. 

Proof: Let a e F be a zero of /(x). Then fix) = {x - a) ■ q\{x), qi(x) eF[x], by Theorem 16.3.3. If & e F is a zero of qi(x), then again by 
Theorem 16.3.3, / (x) = (x - a) (x - b) qjix), qiix) e F[x]. Continue this process, which must terminate in at most n steps since the degree of 
qiiix) would hs n - k. M 

From Theorem 16.3.3 we can obtain yet another insight into the problems associated with solving polynomial equations; that is, finding the 
zeros of a polynomial. The theorem states that an element a e F is a zero of / (x) e F[x] if and only if x - a is a factor of / (x). The initial 
important idea here is that the zero a is from the ground field F. Second, a is a zero only if (x - a) is a factor of / (x) in F[x] —that is, only 
when / (x) can be factored (or reduced) to the product of (x - a) times some other polynomial in F[x] . 

Example 16.3.6. Consider the polynomial /(x) = x^ - 2 taken as being in Q[x]. From high school algebra we know that /(x) has two 
zeros (or roots), namely ± V2~, and x^ - 2 can be factored (reduced) as (x - V2")(x + V2"). However, we are working in Q[x], these two 

factors are not in the set of polynomials over the rational numbers, Q since V2~ ^ Q . Therefore, x^ - 2 does not have a zero in Q since it 
cannot be factored over Q. When this happens, we say that the polynomial is irreducible over Q. 

The problem of factoring polynomials is tied hand-in-hand with that of the reducibility of polynomials. We give a precise definition of this 
concept. 

Definition: Irreducible over F. Let[F, +, ■] be afield and let f(x) e F[x] be a nonconstant polynomial, f(x)) is irreducible over F if and 
only if f(x) cannot be expressed as a product of two (or more) polynomials, both from F[x] and both of degree lower than that off(x). 

A polynomial is reducible over F if it is not irreducible over F. 

Example 16.3.7. The polynomial / (x) = x* + 1 of Example 16.3.5 is reducible over Z2 since x'* + I = (x + l){x^ + x- + x - 1). 

Example 16.3.8. Is the polynomial f (x) = x^ + x + I of Example 16.3.4 reducible over Z2 ? From Example 16.3.4 we know that 
X 4- 1 is not a factor of x-* + x + 1 , and from high school algebra we realize that a cubic (also second-degree) polynomial is reducible if and 
only if it has a linear (first-degree) factor. (Why?) Does /(x) = x-' -1- x -1- 1 have any other linear factors? Theorem 16.3.1 gives us a quick 
way of determine this since x - a is a factor of x^ + x + 1 over Z2 if and only if a e Z2 is a zero of x^ -1- x 4- 1 . So x-" -1- x 4- 1 is 
reducible over Z2 if and only if it has a zero in Z2 . Since Z2 has only two elements, and 1 , this is easy enough to check. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United States License. 



Chapter 16 - An Introduction to Rings and Fields 



401 



/(O) = 0^ +2 0+2 1 = 1 and 
/(I) = 13 +2 1 +2 1 = 1 
so neither nor 1 is a zero of /(x) over Z2 . Hence, + x + 1 is irreducible over Z2. 

From high school algebra we know that x^ + x + I has three zeros from some field. Can we find this field? To be more precise, can we 
construct (find) the field which contains Z2 and all zeros of + x + 1? We wiU consider this task in the next section. 

We close this section with a final analogy. Prime numbers play an important role in mathematics. The concept of irreducible polynomials (over 
a field) is analogous to that of a prime number. Just think of the definition of a prime number. A useful fact concerning primes is: If p is a prime 
and if p\ab, then p \ aor p \ b. We leave it to the reader to think about the veracity of the following: If p (x) is an irreducible polynomial over 
F, a(x), b(_x) e F[x] and p(x)\a (x) b (x), then p(_x)\a (x) or p(x)\b (x). 

EXERCISES FOR SECTION 16.3 
A Exercises 

1 . Let / (x) = 1 + X and g(x) = \ + x + . Compute the following sums and products in the indicated rings. 

(a) / (x) + g (x) and / (x) • g (x) in Z[xJ 

(b) / (x) + g (x) and ,/■ (x) ■ g (x) in Z2[x] 

(c) (/(x)-g(x))-/(x) inZW 

(d) (/(x)-g(x))-/(x)inZ2[x] 

(e) /(x) -/(x) + /(x)-g(x)inZ2[x] 

2. Complete Example 16.3.3. 

3. Prove that: 

(a) The ring IR is a subring of the ring R[x] . 

(b) The ring Z[x] is a subring of the Q[x]. 

(c) The ring Q[x] is a subring of the ring IR[x]. 

4. (a) Find all zeros of x"* + 1 in Z3 . (b) Find all zeros of x' + 1 in Z5 . 

5. Determine which of the following are reducible over Z2 . Explain. 

(a) /(X) = x3 + 1 

(b) g(x) = x' + + X. 

(c) h(x) = x^ + x^ + 1. 

(d) k{x) = x" + x^ + 1 . (Be careful.) 

6 . Prove the second half of Theorem 16.3.3. 

7. Give an example of the contention made in the last paragraph of this section. 

8. Determine all zeros of x'' + 3 x^ + 2 x + 4 in Z5[x] 

9. Show that x^ - 3 is irreducible over Q but reducible over the field of real numbers. 
B Exercises 

10. The definition of a vector space given in Chapter 13 holds over any field F, not just over the field of real numbers, where the elements of F 
are called scalars. 

(a) Show that F[x] is a vector space over F . 

(b) Find a basis for F[x] over F. 

(c) What is the dimension of F[x] over F? 

1 1 . Prove Theorem 16.3.2. 

(a) Show that the field R of real numbers is a vector space over R . Find a basis for this vector space. What is dim R over R? 
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(b) Repeat part a for an arbitrary field F. 

(c) Show that R is a vector space over Q. 
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1 16.4 Field Extensions 



From high school algebra we realize that to solve a polynomial equation means to find its roots (or, equivalently, to find the zeros of the 
polynomials). From Example 16.3.5 of the previous section we know that the zeros may not lie in the given ground field. Hence, to solve a 
polynomial really involves two steps; first, find the zeros, and second, find the field in which the zeros lie. For economy's sake we would like 
this field to be the smallest field that contains all the zeros of the given polynomial. To illustrate this concept, let us reconsider Example 16.3.5. 

Example 16.4.1. Let f{x) = - 2 e Q[x] . It is important to remember that we are considering -2 over Q, no other field. We would 

like to find all zeros of f(x) and the smallest field, call it S for now, that contains them. The zeros are x = ±V2", neither of which is an 
element of Q. The set S we are looking for must satisfy the conditions: 

(1) 5 be a field. 

(2) S must contain Q as a subfield, 

(3) 5 must contain all zeros of f{x) = x^ - 2, and 

By condition (3), must be an element of S, and, if S is to be a field, the sum, product, difference, and quotient of elements in S must be 
in S. So V2", (V2~) , {^) ' ■■■ ' ^ + ^ ' ^ ~ ^ ' ^ / V2" must all be elements of S. Further, since S contains Q as a 

subset, any element of Q combined with V2" under any field operation must be an element of S. Hence, every element of the form 

a + , where a and h can be any elements in Q, is an element of S. We leave to the reader to show that 5 is a field (see Exercise 1 of this 

section). We note that the second zero of x^ - 2, namely - V2~, is an element of S. To see this, simply take a = and b = -\. The field 5 
is frequently denoted as Q^V2"j, and it is referred to as an extension field of Q. Note that the polynomial - 2 = |x - V2"j [x + V2"j 

factors into linear factors, or splits, in Q^V2^jW; that is, all coefficients of both factors are elements of the field Q^V2^j. 

Example 164.2. Consider the polynomial g(x) = x^ + x + 1 e Z2M. Let's repeat the previous example for g(x) over Z2. First, 
g (0) = 1 and g (/) = 1, so none of the elements of Z2 are zeros of g (x). Hence, the zeros of g (x) must he in an extension field of Z2. By 
Theorem 16.3.3, g (x) = x^ + x + 1 can have at most two zeros. Let a be a zero of g (x). Then the extension field S of Z2 must contain 
a -a = ,0" ,a + a, a + 1, and so on. But, since g (a) = 0, we have + a + 1 = 0, or, equivalently, = -(a + 1) = a + 1 (remember, 
we are working in an extension of Z2). Note the recurrence relation we can use to reduce powers of a. 

So far our extension field 5 of Z2 is the set {0, 1, a, a + 1}. For S to be a field, all possible sums, products, differences, and quotients of 
elements in 5 must be in 5. Let's try a few: 

a + a = aiy +2 1) = a-0 = eS 

Since a + a = 0, -a = a, which is in S. Adding three a's together doesn't give us anything new: a + a + a = aeS In fact, na is in S for 
all possible positive integers n. Next, 

3 2 
a = a ■ a 

= (a + l)-a 

= + a 

= (a + 1) + a 

= 1 eS 



Therefore, a ' = a + 1 and (a + 1) ' = a. 

It is not difficult to see that a" is in S for all positive n. Does S contain all zeros of x^ + x + 1? Remember, g{x) can have at most two 
distinct zeros and we called one of them a, so if there is a second, it must be a + 1 . To see if a + 1 is indeed a zero of g (x), simply compute 

f(a+ 1): 

f(a + l)=(a+lf + (a+l) + l 

= a^+l+a + l+l 
= 0^ + + 1 
= 

Therefore, a + 1 is also a zero of x^ + x + 1 . Hence, 5 = {0, 1, a, a + 1) is the smallest field that contains Z2 = {0, 1} as a subfield and 
all zeros of x^ + x + 1. This extension field is denoted by Z2(a). Note that x^ + x + 1 spUts in Z2(a); that is, it factors into linear factors in 
Z2(a). We also observe that Z2(a) is a field containing exactly four elements. By Theorem 16.2.4, we expected that Z2(a) would be of order 
for some prime p and positive integer n. Also recall that all fields of order p" are isomorphic. Hence, we have described all fields of order 
2^ = 4 by finding the extension field of a polynomial that is irreducible over Z2. 

The reader might feel somewhat uncomfortable with the results obtained in Example 16.4.2. In particular, what is a? Can we describe it through 

a known quantity? All we know about a is that it is a zero of g (x) and that = a + 1. We could also say that a (a + 1) = 1, but we really 

expected more. However, should we expect more? In Example 16.4.1, V2~ is a number we are more comfortable with, but all we really know 
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about it is that a = is the number such that = 2. Similarly, the zero that the reader will obtain in Exercise 2 of this section is the 
imaginary number i. Here again, this is simply a symbol, and all we know about it is that f = -I. Hence, the result obtained in Example 16.4.2 
is not really that strange. 

The reader should be aware that we have just scratched the surface in the development of topics in polynomial rings. One area of significant 
applications is in coding theory. 

EXERCISES FOR SECTION 16.4 
A Exercises 

1 . (a) Use the definition of a field to show that Q( V2") is a field. 

(b) Use the definition of vector space to show that q| V2"j is a vector space over Q. 

(c) Prove that |l, V2"| is a basis for the vector space Q|V2"j over Q, and, therefore, the dimension of Q(V2") over Q is 2. 

2. (a) Determine the splitting field of / (a) = x- + 1 over R . This means consider the polynomial / (jr) = + 1 e R [a:] and find the smallest 
field that contains R and all the zeros of f(x). Denote this field by R(i). 

(b) R(i) is more commonly referred to by a different name. What is it? 

(c) Show that { 1 , i} is a basis for the vector space R(i) over R . What is the dimension of this vector space (over R)? 

3. Determine the splitting field of j:'' - 5x^ + 6 over Q. 

4. (a) Factor j;^ + x + 1 into linear factors in Z2(a). 

(b) Write out the field tables for the field Z2(a) and compare the results to the tables of Example 16.2.2. 

(c) Cite a theorem and use it to show why the results of part b were to be expected. 

5. (a) Show that x^ + x + 1 is irreducible over Z2. 

(b) Determine the splitting field of x^ + x + 1 over Z2. 

(c) Use Theorem 16.2.4 to illustrate that you have described all fields of order 2^ . 

6. (a) List all polynomials of degree 1, 2, 3, and 4 over Z2 = GF(2). 

(b) Use your results in part a and list all irreducible polynomials of degree 1, 2, 3, and 4. 

(c) Determine the splitting fields of each of the polynomials in part b. 

(d) What is the order of each of the spUtting fields obtained in part c? Explain your results using Theorem 16.2.4. 

I 16.5 Power Series 

In Section 16.3 we found that a polynomial of degree n over a ring if is an expression of the form 

n 

fix) = YjO-i^ = o-o + aix +02:)^ + ■■■ +a„y^ , n >0, 

i=0 

where each of the a,- are elements of if and a„ 0. In Section 8.5 we defined a generating function of a sequence s with terms so, si, S2, ... as 
the infinite sum 

G(i, z) = 'ZsiZ' = so + siz + S2Z^ + ■■■ 

1=0 

The main difference between these two expressions, disregarding notation, is that the latter is an infinite expression and the former is a finite 
expression. In this section we will extend the algebra of polynomials to the algebra of infinite expressions like G (s, z), which are called power 
series. 

Definition: Power Series. Let [if; +, •] be a ring. A power series over if is an expression of the form 

CO 

fix) = 2 fl; = flo + aix +a2X^ + ■■■ 

i=0 

where a/, 02, a^, ... e R. The set of all such expressions is denoted by R[[x]]. 

Our first observation in our comparison of R[x] and if[M] is that every polynomial is a power series and so R[x] c if[[j:]]. This is true because 
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a polynomial oq + x + a2 + ■■■ + a„x" of degree n in R[x], can be thought of as an infinite expression where a; = for > n. In addition, 
we will see that R[[x]] is a ring with subring R[x]. 

R[[x]] is given a ring structure by defining addition and multipUcation on power series as we did in R[x], with the modification that, since we 
are dealing with infinite expressions, the sums and products will remain infinite expressions that we can determine term by term, as was done in 
Section 16.3. 

Definition: Power Series Addition and Multiplication. Given power series 
/W = Yi^^i^ = (^0 + aix +a2X^ + ■■■ 

i=0 

and 



g{x) = Yibii^ =bQ + bi X + b2 x^ + 

i=0 



their sum is 



f(x) +g(x) = Z(ai +bi)x' 

i=0 

and their product is 

fix) -gix) = f; di x' 

i=0 

where 

I 

di = Y^ajhi^j 

Let's look at an example. 

Example 16 J.l. (Example 8.5.3, Revisited.) Let 

fix) = Xix' =0 + lx+2x^ +3x^ + ■■■ 

i=0 

and 

gix) = Z2'V =1 +2x +4x? +8^ + ■■■ 

i=0 

be elements in Z[[j;]]. Let us compute / (j:) + gix) and f ix)- gix). First the sum: 

00 00 oo 

/ ix) + gix) = 2 ! + Z 2' ;c' = Z 0' + 2') x^ 

i=0 i=0 i=0 

= 1 +3;c + 6;i:^ + ll;!:^ + ••• 
The product is a bit more involved: 



x)-gix) =(E'x'j[E2'x-) 



where 



For example. 



Hence, 



:(0 + lx + 2x^ + 3x^ + •••)(! +2x + 4x^ + ?.x^ + ■■■) 
= 0-1 + (0-2 + M);<: + (0-4 + l-2 + 2-l);c^+ ••• 

CO 

= Ldix' 

i=0 



j=0 j=0 



d3=0-2^+ 1 -2^+ 2-2i+3-2° 
=0+4+4+3 
= 11 
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f(x)-g(x) = x + 4x^ + ll}^ + ••• 
The First few terms of the product do not suggest a pattern but with Mathematica, we can get a closed form expression for the coefficients. 

i 

Simplify[2y2'-^] 
j=o 

-;■ + 2'+' - 2 
Therefore, d,- = 2'+^ - ; - 2 and 

= E(2'+' -i-2)x' 

i=0 

We have shown that addition and multipMcation in is virtually identical to that in R[x\. The following theorem parallels Theorem 16.3.1, 

establishing the ring properties of if[M]. 

Theorem 16J.1. Let [R, +, •] be a ring. Then: 

(1) R[[x]] is a ring under the operations of power series addition and multiplication, which depend on (are induced by) the operations in R. 

(2) IfR is a commutative ring, then R[[x]] is a commutative ring. 

(3) IfR is a ring with unity, 1, then R[[x]] is a ring with unity (the unity in R[x] isl+0x + 0x^+---). 

(4) IfR is an integral domain, then R\{x\\ is an integral domain. 

(5) IfF is afield, then F[[x\\ is not a field. However, F[[xJJ is an integral domain. 

We are most interested in the situation when the set of coefficients is a field. Theorem 16.5.1 indicates that when F is a field, F[[xJ] is an 
integral domain. A reason that F[[x]] is not a field is the same as one that we can cite for F[x], namely that x does not have multiplicative 
inverse in With all of these similarities, one might wonder it the rings of polynomials and power series over a field are isomorphic. It 

turns out that they are not. 

The difference between F[x\ and become apparent when one studies which elements are units (i.e., elements that have multiplicative 

inverses) in each. First we prove that the only units in F[x\ are the nonzero constants— that is, the nonzero elements of F. 

Theorem 16 J J. Let [F; +, •] be afield, f(x) is a unit in F[x] if and onlyf(x) is a nonzero element ofF. 

Proof: (=>) Let fix) be a unit in F[x]. Then f (x) has a multiplicative inverse, call it g{x), such that f (x) ■ g(x) = 1. Hence, the 
deg (/■ (x) • g (x)) = deg(l) = 0. But deg (/' (x) • g (x)) = deg/(x) + degg(x). So deg/(x) + degg(x) = 0, and since the degree of a 
polynomial is always noimegative, this can only happen when the deg/(jr) = d&gg(x) = 0. Hence, f{x) is a constant, an element of F, 
which is a unit if and only if it is nonzero. 

(<=) If f(x) is a nonzero element of F, then it is a unit since F is a field. Thus it has an inverse, which is also in F[x] and so f(x) is a unit of 
F[x].m 

Before we proceed to categorize the imits in F[[x]], we remind the reader that two power series co + uix + a2X^ + ■■■ and 
bo + b\x + b2X^ + • • • are equal if and only if corresponding coefficients are equal, a; = bi for all ;> 0. 

Theorem 16S3. Let [F; +, •] be afield. Then f{x) = 2 a, ^ is a unit o/F[[j;]] if and only ifag i= 0. 
Proof: (=>) If fix) is a unit of F[[x]] , then there exists gix) = J^bixf in F[[x]] such that 

!=0 

fix) -gix) = (ao + aix + a2^ + •••)-ibo + bix + b2}^ + ■■■) 
= 1 

= 1 + 0;<: + 0;<:^+ ••• 

Since corresponding coefficients in the equation above must be equal, 00-^0 = 1, which implies that ao =#= 0. 

00 

(«=) Assume that ao # 0. To prove that fix) is a unit of F[[j:]] we need to find gix) = Yjbixf inF[[j:]] such that 

!=0 

f(.x)-gix) = Y.dix'= 1. 
1=0 

If we use the formula for the coefficients rfo.'^i.'^z , ... of fix) -gWandequatecoefficients, we will obtain 
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di = Gobi + ai bo =0 
d2 = ao b2 +ai b\ + 02 bo 

ds = ao bs + fli bs-\ + ■■■ + a^bo 

Therefore, the existence of g (x) is equivalent to the existence of a solution bo,b\,b2, .... to the above system of equations. Since ao * 0, we 
can solve the first equation for bo- Then we can continue to the second equation and solve for fci, then 62 can be found by solving the third 
equation, etc. Hence, 

bo = ao"' 

bi = -ao"'(ai bo) 

bi = -ao"'(ai bi + a2bo) 

bs = -ao"'(ai Vi + 02^2 + ••• +asbo) 

CO 

Therefore the powers series 2 bi x' is an expression whose coefficients lie in F and that satisfies the statement f(x) ■ g(x) = 1 . Hence, g (x) is 

(=0 

the multiplicative inverse of f(x) and f(x) is a unit.. 
Example 16.5^. Let 

f(x) =1 + 2x + 3x^ + 4x^ + ■■■ 

= E(/+ 

;=0 

be an element of Q[[x]]. Then, by Theorem 16.5.3, since ao = 1 + 0, / (x) is a unit and has an inverse, call it g (x). To compute g (x), we follow 
the procedure outlined in Theorem 16.5.3 . Using the formulas for the fe, 's, we obtain 

bo = 1 

bi = -1(2-1) = -2 

b2 = -l(2-(-2)+ 3-1) = 1 

bi= -1(2-1 + 3-(-2) + 4-l)=0 

b4= -1(2-0 + 3-1 + 4-(-2) + 5-l) = 

bi= -l(2-0 + 3-0 + 4-(l) + 5-(-2) + 6-l)=0 

bs= -1(2-0 + 3-0 + ---(j-2)-0 + (j-l)-l+j-(-2) + (j + l)-l) = j>3 
Hence, g(x) = I - 2x is the multiplicative inverse of /(x). 

If we compare Theorems 16.5.2 and 16.5.3, we note that while the only elements in F[x] that are units are the nonzero constants of F, the imits 
in F[[x]] are every single expression in x where ao i= 0. So certainly F[[x]] contains a wider variety of units than F[x]. Yet F[[x]] is not a field, 
since x e F[[x]] is not a unit by Theorem 16.5.3. So concerning the algebraic structure of F[[x]], we know that it is an integral domain that 
contains F[x\. If we allow our power series to take on negative exponents— that is, consider expressions of the form 

fix) = 2 aix' 

where all but a finite number of terms with a negative index equal zero. These expressions are called extended power series. The set of all such 
expressions is a field, call it E. This set does contain, for example, the inverse of x namely x'^ . It can be shown that each nonzero element of E 
is a unit. 

EXERCISES FOR SECTION 16.5 
A Exercises 

l.Let f(x)= Y^di^' and g(,x)= 2^; J^' be elements of i?[[x]]. Let 

1=0 j=0 

.m-g(x) = Y.dix'= 1. 

1=0 

(a) Apply the distributive law repeatedly to 

(ao + a\ x + a2X^ + ■■■)-(bo + bi x + b2 x^ + ■■■) 
to obtain the formula 
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s 

ds= Y, O-i bs-i 

i=0 

for the coefficients of /'(x) • g{x). Hence, you have shown that 

fix) ■ gix) = ziZoi bs-i] 
j=oVi=o / 

(b) Apply the above formula to the product in Example 16.5.1 and show that the result is the same as that obtained in this example. 

2. (a) Prove that for any integral domain D, the following can be proven: 

CO 

fix) = 2 is a unit of D[[x]] if and only if ao is a unit in D. 

i=0 

(b) Compare the statement in part a to that in Theorem 16.5.3. 

(c) Give an example of the statement in part a in Z[[j;]] . 

3. Use the formula for the product to verify that the expression g(x) of Example 16.5.2 is indeed the inverse of f(x). 

00 

4. (a) Determine the inverse of / (jt) = 1+ x + :>^ + ---= "Zx" iaQ[[x]]. 

1=0 

(b) Repeat part a with / (x) taken in Z.2[[x]] . 

CO J 

(c) Use the method outlined in Chapter 8 to show that the power series fix) = "^xf is the rational generating function - — . What is the 

1=0 ' ^ 

inverse of this function? Compare your results with those in part a. 

CO 

5. (a) Determine the inverse of = inQ[[j;]]. 

1=0 

(b) Use the procedures in Chapter 8 to find a rational generating function for hix) in part a. Find the multiplicative inverse of this 
function. 

6. Leta(x) = 1 + 3x + 9^^ + 27 + ••• = 23' x" and 

(=0 

bix) = I + X + x^ + x^ + ■■■ =Y. x' bothinR[[x]]. 

(a) What are the first four terms (counting the constant term as the 0* term) of aix) + b (x)? 

(b) Find a closed form expression for a (x) . 

(c) What are the first four terms of a (x) b (x)? 

B Exercise 

7. Write as an extended power series: 

(a) (x^-x')-' 

(b) ix^-lx^+x'^)-^ 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 16 

Section 16.1 

1. (a) Expand (A + B)^ in the ring [M„x„(IR); + , •]• 

(b) Will your result be similar for any noncommutative ring? 

2. (a) Expand (A + B)Mn the ring [M„x„(IR); + , •]• 

(b) Win your result be similar for any noncommutative ring? 

3. Let D be the set of all 2 x 2 diagonal matrices over the real numbers. 

(a) Prove that D is a subring of [M2x2(IR); + , •] > hence a ring under the usual operations. 

(b) Prove that £) is a commutative ring with unity. 

(c) Is the cancellation law true in D^ 

4. (a) Use the definition of a ring to convince yourself that R = ^ + b V2^ | a, b ezj is a ring. A common name given this ring is z|^V2^j . 

(b) What is the unity of z[V2~]? 

(c) Prove that z|^V2"| is an integral domain. 

5. It can be shown, in general, that if if is any ring, [M„xnW; + , •] is a ring. 

(a) How many elements are there in the ring R = [M2x2(Z2); + , •]? What are the zero and unity of if? 

(b) Determine all solutions of the equation - 7 = in if . 

6. Find all six units of [M2x2(Z2); + , •]• Hint: The set of units is closed with respect to multipUcation and one of them is ^ J ^ j 

7. Let ^={^Q o)]''^''^} "^^^ A is a ring under matrix addition and multiplication. Prove that A is isomorphic to the ring of real numbers.. 

Section 16.2 

8. Show that Z2 is a subfield of the field given in Example 16.2.2, or equivalently, that the field in this example is an extension field of Z2. 

9. Show that a and b are the two roots of the equation j;^ + jr+l=Ointhe field of Example 16.2.2. 

10. Let A = 1^ ''^ ^ j I a, 6 e Ir|. Prove that A with matrix addition and multipUcation is isomorphic to the ring of complex numbers, C. 
Section 16.3 

1 1 . Find all rational zeros (roots) of f(x) = x"* - 6 + 10 - 6 x + 9 and factor f(x) into irreducible factors in Q[x] . 

12. Determine aU zeros of / (x) = x' + 1 in the field of Example 16.2.2, and express / (x) as a product of irreducible factors over that field. 

13. Repeat Exercise 12forg(x) = x^ + x^ + x, 

14. Find all five roots of /(x) = x^ + 7 x in Zg . Explain why this does not contradict Theorem 16.3.4. 

Exercises 15 to 20 develop an introduction to polynomial codes. In Chapter 15 we introduced group codes. Here, we will discuss another code 
that uses polynomials. A k - tuple in Z2'" can be identified with a polynomial of degree k - I in the integral domain Z2LXJ and conversely. We 
do this by associating a k-tuple with the coefficients of a polynomial starting with the constant term. For example, the 5-tuple (1,0, 1 , 1 , 0) is 

viewed as the polynomial l+Ox+lx^ + lx'+Ox'' = l + x^ + x^.Ifwe define addition and multiplication on Z2* based on polynomial 
operations, we wiU have highly structured codes. For the actual code, we present an example where k = 7. 
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15. To add ^-tuples, we can take two equivalent approaches. We can either simply add the ^-tuples coordinatewise as in any direct product, or 
we can covert the ^-tuples to polynomials of degree ^ - 1 or less, add them, and then write down the coefficients of the sum. 

(a) For each of the following pairs of add and multiply the pairs A:-tuples, where k varies, compute their sum. Use both ways to add for 
at least one part. 

(i) (0, 1, 0)and (1, 1, 1) 

(ii) (0, 1,0, l)and(l, 1,0, 1) 

(iii) (1, 1, 1,0, 1, 0, 1) and (0, 0, 0, 0, 1, 0, 0) 

(iv) (1,0, 0, 1, 1, 1, 1) and (0, 0, 0, 1, 0, 0, 0) 

(b) What what relationship between polynomials and ^-tuples makes it possible to do this addition two different ways to get the same 
sum. 

16. The encoding of a string of bits is based on polynomial division. Given a four bit message, we make the bits coefficients of a sixth degree 
polynomial, b^x^ + b^x'^ + b^x^ + b^x^ which we can also express in Z2* as (0,0,0, b^, b4, b^, b^), we divide this polynomial by 
p(_x) =1 + x + x^ and add the remainder to the "message polynomial. The quotient is in the division is discarded. Thus, if the remainder, 
which must be a polynomial of degree less than 2, is + bix + b2X^,the encoded message is the string of bits (bo, bi, 62, b^, 64, b^, b^). 

(a) Encode the following elements of Z2^as described above. 

(a) (0, 0, 0, 1, 1, 0, 1) 

(b) (0, 0, 0, 1, 1, 1, 1) 

(c) (0, 0,0,0,0, 1, 0) 

(b) Prove that the encoded message will always represent a polynomial with is evenly divisible by the polynomial pix) that is used to 
encode the message. 

17. A single bit error in the transmission of our seven bit encoded message (ho, hi, h2, b^, hn, h^, h(,) can be though of as a monomial 
expression x-i, where < j <6. The effect of an error occurring is to add that monomial to the encoded message. So if the last bit is transmit- 
ted incorrectly, the monomial x^ is added and the received bit sequence is (bo, bi, b2, b^, b^, bj, b^ +2 1). If no error takes place, we can think 
of the zero polynomial being added. Prove that if an error takes place, the received bit string represents a polynomial that is not a multiple of 
p(x). 

18. There are seven different single bit errors. Let's focus on what happens if an error occurs in the last bit. If the error occurs in the last bit 
and the received bit string represents the polynomial m(x), show that the remainder upon dividing m(x) by p(x) will be the same for all possible 
values of m(x) . What is that remainder? This is called the syndrome for an error in the last bit. 

19. What are the syndromes for each of the other error positions? Let's agree to number them 0* through 6* , so the 6* position syndrome 
was determined above. What the syndrome if no error occurs? 

20. Assuming no more than a single bit error in the transmission of seven bits, what is the transmitted bit string, given these received strings? 

(a) (0, 1,0, 1,0, 1, 1) 

(b) (1, 1, 1,0,0,0,0) 

(c) (0, 0, 1, 1, 0, 1, 0) 

Section 16.4 

21. In Exercise 5 of Section 16.4 you constructed GF(8) using x^ + x + 1. Show that GF(8) can also be obtained by using the polynomial 
g(x) = x^ + x^ + 1. 

22. (a) Show that f (x) = x"* + x + 1 is irreducible over Z2 . 

(b) Describe the splitting field of / (x) over Z2 . 

(c) Let a be a zero of / (x). Show that each nonzero element of the splitting field in part (b) can be described as a power of a. 
Section 16.5 

23. Review Example 16.5.2. Derive the multiplicative inverse of l-2x + :)^ by doing repeated polynomial division, as suggested by the 
following first step: 
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1 

2x-x^ 

24. Use polynomial long division to obtain the power series representation of j^over Q. What is the inverse of the power series you obtained? 

25. Find the generating function for the sequence defined by the difference equation = aj_i + aj_2, k>2, with cq = ai = 1 the indicated 
fields. 

(a) Q 

(b) Z2 

(c) Z3 

26. Determine the inverse of each of the power series in Exercise 25. 

27. Recall from high school algebra that any quadratic with real coefficients, of the form ax^ + bx + c =0, ai^Q, can be solved using the 
quadratic formula: 

-h±^ h^-Aac 

X = 

2 a 

(a) Does this formula always produce zeros in R? 

(b) Use the quadratic formula to solve x^ + x + 1 = in Z3 , 

(c) Use the quadratic formula to solve + 2 = in Z3 . 

(d) Use the quadratic formula to solve + x + 2 = in Z3 . 

(e) What observations do you have based on parts (b) - (d)? 
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Solutions and Hints to Odd-Numbered Exercises 

CHAPTER 1 
Section 1 .1 

l.(a) 8, 15, 22, 29 

(b) apple, pear, peach, plum These solutions are not unique. 

(c) 1/2, 1/3, 1/4, 1/5 

(d) -8, -6, -4, -2 

(e) 6, 10, 15, 21 

3.(a) (2 /t + 1 : e Z, 2 « « 39) (b) (x e Q : -1 < x < 1) 

(c) |2n:«eZ) (d) (9 « : n e Z, -2 s; «) 

5 .(a) True (b) False (e) True (d) True (e) False 

(f) True (g) False (h) True 
Section 1 .2 

l.(a)|2, 3) (b)(0, 2, 3) (c) (0, 2, 3) (d) (0, 1, 2, 3, 5, 9) 
(e)(0) (f)0 (g)j 1, 4, 5, 6, 7, 8, 9) (h) |0, 2, 3, 4, 6, 7, 8) 
(i) 0) |0) 

3. These are all true for any sets A, B, and C. 

5.(a)|l, 4) c A c (1, 2, 3, 4) 

(b) |2) c A c {1, 2, 4, 5) (c) A = {2, 4, 5) 

7. 
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? Select 

Select[/;if, crit] picks out all elements e, of list for which critlei] is True. 
Select|/«f, crit, n] picks out the first n elements for which cnf[e,] is True. » 

? PrimeQ 

Pr\meQ[expr] yields True if expr is a prime number, and yields False otherwise. » 

(b) 

Select [Range [2000, 2099], PrimeQ[tt] &] 

(2003, 2011, 2017, 2027, 2029, 2039, 2053, 2063, 2069, 2081, 2083, 2087, 2089, 2099) 

Section 1 .3 

l.(a)|(0, 2), (0, 3), (2, 2), (2, 3), (3, 2), (3, 3)) 

(b) 1(2, 0), (2, 2), (2, 3), (3, 0), (3, 2), (3, 3)) 

(c) ((0, 2, 1), (0, 2, 4), (0, 3, 1), (0, 3, 4), (2, 2, 1), (2, 2, 4), (2, 3, 1), (2, 3, 4), (3, 2, 1), (3, 2, 4), (3, 3, 1), (3, 3, 4)) 

(d) 

(e) ((0, 1), (0, 4), (2, 1), (2, 4), (3, 1), (3, 4)) 

(f) 1(2, 2), (2, 3), (3, 2), (3, 3)) 

(g) ((2, 2, 2), (2, 2, 3), (2, 3, 2), (2, 3, 3), (3, 2, 2), (3, 2, 3), (3, 3, 2), (3, 3, 3)) 

(h) 1(2, 0), (2, |2)), (2, (3|), (2, |2, 31), (3, 0), (3, |2)), (3, (3)), (3, |2, 3))) 
3.|a, b], {a, c), (a, d), {h, c), {/!), t/), (c, d] 

5. There are n singleton subsets, one for each element. 

7. (a) 1+00, +01, +10, +11, -00, -01, -10, -11) (b) 16and512 

9. When A = B 

Section 1 .4 

l.(a) 11111 (b) 100000 (c) 1010 (d) 1100100 

3. (a) 18 (b) 19 (c) 42 (d) 1264 

5. There is a bit for each power of 2, starting with the zeroth power. The number 

1990 is between 2'" = 1024 and 2" = 2048, so there ai-e 10 + 1 (the power 
of 2) bits. 

(a) 11 (b) 12 (c) 13 (d) 8 
7. It must be a multiple of four. 

Section 1 .5 

l.(a) 24 (b) 6 (c) 3, 7, 15, 31 (d) 1,4, 9, 16 

, , , 1 1 1 1/1 
3. (a) + + + ■ ■ ■ + = — 

^ ' 1(1 + 1) 2(2+1) 3(3+1) n(n+l) n+1 

,, . 1 1 1 1113 3 

(b) — -\ + — = -+ - + —= - = 

1 (2) 2(3) 3(4) 2 6 12 4 3+1 

(c) 1 + 2^ + 3^ + ••• + «3 = {'-)n^in + if 

1 +4 + 27 = 36 (j) (3)2 (3 + 1)2 = 36 

5.{x + y)" = Ci)x" + { "^)x"-'y + Ci)x"-^y^+■■■ + ( ■Li)xy"-'+( "Jy" 
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7.(a){xeQ\0<x^5} (b) 

(c) e Q I -5 < ;c < 5} = B5 (d) {;c e Q | -1 < ;«: < 1} = Bj 
9.(a) 36 (b) 105 

Supplementary Exercises— Chapter 1 

l.(a)|2, 1) (b)0 (c)(i-i) 

3.(a){0, 3,4, 5,6, 7,8,9) (b) (3, 6, 9) (c) (0,1,2) 
5. (a) A U B = {1, 2, 3, 4, 5, 6, 7, 9, 12} => |(A U B)l = 9 

|A| = 6, |B| = 5 AnB = {2,5)^|(AnB)|=2 

|A| + |B|-|(AnB)l = 6 + 5-2 = 9 

(b) 10,8,2 

(c) (A U B U C) = l((A U B) U C)l 

by part (a) = |(A U B)| + |C| - |((A U B) H Ql 
Distributive = |(A U B)l + IQ - \((A fl C) U (B fl C))| 
by part (a) = |A| + |B| - |(A fl B)| + |C| - [|(A fl C)| 

+ i(Bnc)i-i((Anc)n(Bnc))i] 

Simplily = |A| + |B| + |C| - |(A R B)l - l(A fl Ql 

-i(Bnc)i + i(AnBnc)i 

7. (a) ((4, 4)) 

(b) {(2, 4), (4, 4), (6, 4)) 

(c) {(4,4,4)) 

(d) {(4, 2), (4, 4), (4, 6)) 

(e) {(2, 4, 1), (2, 4, 5), (4, 4, 1), (4, 4, 5), (6, 4, 1), (6, 4, 5)} 

15. (a) Ao = {0), Ai = {0, 1,2,3), A2 = {0, 1,2, ...,6), A3 = {0, 1,2, ...,9) 

(b) (0, 1, 2, 3) 

(c) (0) 

(d) (0, 1,2, ...,9) 

(e) (0, 1,2, ...,9) 

17. Parts a, b, and d are true with multiplication replacing addition. 
CHAPTER 2 
Section 2.1 

1 . If there are m horses in race 1 and n horses in race 2 then there are m • n possible daily doubles. 
3.72 = 4-6-3 
5.720 = 6-5-4-3-2-l 

7. If we always include the blazer in the outfit we would have 6 outfits. If we 
consider the blazer optional then there would be 12 outfits. When we add a 
sweater we have the same type of choice. Considering the sweater optional 

produces 24 outfits. 

9. (a) 2* = 256 (b) 2* = 16. Here we are concerned only with the first four 
bits, since the last four must be the same. 

(c) 2' = 128, you have no choice in the last bit. 
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11. (a) 16 (b)30 

13. (a) 3 (b)6 

15. 18 
17. (a) 



Start 




abed 



(b) 5^ 

19.2"-' - 1 and 2" -2 
Section 2.2 

I. P(1000; 3) 

3. With repetition: 26** ~ 2.0883 x 10" 
Without repetition: P(26; 8) ~ 6.2991 > 10'° 
5. 15! 

7.(a)P(15;5) = 360360 (b) 2- 14- 13- 12- 11 =48048 
9.2-P(3;3)= 12 

I I . (a) P(4; 2) = 12 (b) Pin; 2) = n{n-l) 

(c) Case \:m>n. Since the coordinates must be different, this case is impossible. 
Case 2: m ^ n.P(n; m). 

Section 2.3 

l.{{al, {b\, (cll, {{a, b\, (cl), {{a, c\, {b\\, {{a\, [b, c\\, {{a, b, c]} 
3. No. By this definition it is possible that an element of A; might not belong to A. 
5. The first subset is all the even integers and the second is all the odd integers. 
These two sets do not intersect and they cover the integers completely. 

7. Since 17 participated in both activities, 30 of the tennis players only played tennis and 35 of the swimmers only swam. Therefore, 
17 + 30 + 35 = 82 of those who were surveyed participated in an activity and so 8 did not. 

9. Assume |(Ai U ^2)! = |A,| + IA2I - |(Ai R Aj)!. 
l(Ai UA2UA3)| = |((Ai UA2)UA3)| 

by 1st law = |(Ai U ^2)! + IA3I - |((Ai U A2) Aj)! 

Distributive = |(Ai U ^2)! + IA3I - |((Ai R A3) U (A2 fl A3))| 

1st law (twice) = |A,| + IA2I - |(Ai R Az)! + IA3I 

-[i(Ai n A3)i + i(A2 n A3)i - i((Ai n A3) n (A2 n a3))|] 

Simplify = |A,| + IA2I + IA3I - |(A, R Az)! - |(Ai R A3)| 

-i(A2nA3)i + i(AinA2nA3)i. 
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(b) l(Ai U A2 U A3 U A4)| = |A,| + IA2I + IA3I + IA4I - |(A, n A2)| 

-i(Ai n A3)i - i(Ai n A4)i - i(A2 n a3)i - i(A2 n a4)i 

-i(A3 n A4)i + i(Ai n A2 n a3)i + kai n az n a4)i 

+i(A, n A3 n A4)i + i(A2 n A3 n a4)i - kai n az n A3 n a4)i 

Derivation: 

l(Ai U A2 U A3 U A4)| = |((Ai U A2 U A3) U A4)| 

1st law = |(Ai U A2 U A3)| + IA4I - |((Ai U A2 U A3) n A4)| 

Distributive = |(Ai U A2 U A3)| + IA4I - |((A, f) A4) U (A2 D A4) 

u (A3 n A4))i 

2nd law (twice) = |Ai| + IA2I + IA3I - |(Ai fl A2)| - |(Ai R Aj)! 

-i(A2 n A3)i + i(Ai n A2 n a3)i + 1A4I - lUi n a4)i 

+ i(A2 n A4)i + i(A3 n A4)i - i((A, n A4) n (A2 n a4))i 
- i((Ai n A4) n (A3 n a4))i - i((A2 n A4) n (A3 n a4))i 
+ i((Ai n A4) n (A2 n A4) n (A3 n a4))|] 

Simplify = |A,| + IA2I + IA3I + IA4I - |(A, fl A,)! - |(A, fl Aj)\ 

-i(A2 n A3)i - i(A, n A4)i - i(A2 n a4)i - i(A3 n a4)i 
+i(Ai n A2 n A3)i + i(A, n A2 n a4)i + kai n A3 n a4)i 
+i(A2 n A3 n A4)i - i(Ai n A2 A3 n a4)i 

1 1 . Hint: Partition the set of fractions into blocks, where each block contains fractions that are numerically equivalent. Describe how you would 
determine whether two fractions belong to the same block. Redefine the rational numbers to be this partition. Each rational number is a set of 

fractions. 

Section 2.4 

I. C(10; 3)-C(25; 4) = 1, 518, 000 

3. C(10; 7) + C(10; 8) + C(10; 9) + C(10; 10) 

5. I6x*-96x^y + 2l6x^y^-2l6xy^ + Sly* 
7. (a) C(52;5) = 2, 598, 960 

(b) C(52; 5) • C(47; 5) • C(42; 5) • C(37; 5) 
9. C(4; 2) C(48; 3) 

II. C(12; 3) -0(9; 4) -0(5; 5) 
13.(a)C(10;2) = 45 (b) C(10; 3) = 120 

15 . Assume |A| = n. If we let = y = 1 in the Binomial Theorem, we obtain 
2" = C(n ; 0) + C(n; 1) + • • • + C(n ; n), and as a consequence of Example 
2.4.7 we realize that the right side of this equation says the sum of all subsets 

of A. Hence |P(A)| = 2l^l 
17. 999,400,119,992. 

Supplementary Exercises— Chapter 2 

l.(a) 10-9-8 = 720 
(b) 10 -10 -10 = 1000 
3. (a) 
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(b) If you imagine drawing a tree diagram for this general case, from the starting point, there will be m branches, one for each element of A. 

From the end of each of the "A branches" there will be n branches, one for each element of B. Therefore, there are m • « pairs in A x B. 

5. (a) If couple A is seated, couple B can be either to their left or right and couple C sits in the other position; therefore, there are two possible 

arrangements. 

(b) 2-23=48 

7. (a) 5 != 120 (b) 5 ! - 2-4 != 72. (Here, we subtract the ways that the 

two could be seated together from the total number of arrangements.) 

9.(a)P(10;4) (b) C(10; 4) -^6; 3) 

ll.C(10; 2)-8 = 360 

13.(a)C(ll;5) = 462 (b) C(10; 4) = 210 

(c) C(2; 1)-C(9;4) + C(9;3) 
15.(a)3P(3;2) = 18 (b) 2P(3; 2) = 12 
CHAPTER 3 

Section 3.1 

l.(a) dAc (b)^V^c 

(c) -^(dAs) (d)^iA^c 

3. (a) 2 > 5 and 8 is an even integer. False. 

(b) If 2 s 5 then 8 is an even integer. True. 

(c) If 2 ^ 5 and 8 is an even integer then 1 1 is a prime number. True. 

(d) If 2 s 5 then either 8 is an even integer or 1 1 is not a prime number. True. 

(e) If 2 ^ 5 then either 8 is an odd integer or 1 1 is not a prime number. False. 
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(f) If 8 is not an even integer then 2 > 5 . True . 
5. Only the converse of d is true. 

Section 3.2 

P pV P p ^p pAp p pf\(^p) 

1. (a) 0~ (b) i 0~ (c) i i 

11 10 10 1 

p pl\p 

(d) 0~ 

1 1 

3.(a)-(pA?) V i (b)(pV?) A ('•V?) 
5. = 16 
Section 3.3 

l.a<=>e, d ^ f, g <^ h 

3. No. In symbolic form the question is: Is (p -> pY! 



p 


1 


p^q 


q^p 


(p^q)<r^ 








1 


1 


1 





1 


1 








1 








1 





1 


1 


1 


1 


1 



il^p) 



This table indicates that an impUcation is not always equivalent to its converse. 
5 . Let X be any proposition generated by p and q. The truth table for x has 4 rows 
and there are 2 choices for a truth value for x for each row, so there are 

2 • 2 • 2 • 2 = 2"* possible propositions. 
(See Table 13.6.1 for an illustration.) 
7.0 -> p and p ^ 1 are tautologies. 
Section 3.4 

1. Let s = "I win study", f = "I wiU learn." The argument is: ((j t) A(-> t)) -*(_->s) , call the argument a. 



s 


t 


J -> f 


(j ^ A (-■ t) 


a 








1 


1 


1 





1 


1 





1 


1 











1 


1 


1 


1 





1 



Since a is a tautology, the argument is valid. 



3. In any true statement 5, replace; A with V, V with A, with 1 , 1 with 0, <= with =>, and with «=. Leave all other coimectives unchanged. 
5 . (a) If not EOF then repeat 
Read (ch); 

Count := Count + 1 
Until EOF 

Law used: involution law. Not Not EOF<=> EOF} 

(b) S := 0; K := 1; N := 100; 
IfK <=A'thendo 
Repeat 
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S:=S + K- 
K:=K+l 
VnmK>N 

No Law of logic is really used here, only a law of integers: 
Not (K<N)^K>N. 

Section 3.5 



l.(a) 



p 


? 


(p\/q)A^q iipy q)A^q)^p 








1 





1 


1 


1 





1 1 


1 


1 


1 



p 


q 


ip^q)A^q 


-^p 


(p^q)A(.^q) 








1 


1 


1 





1 





1 


1 


1 











1 


1 


1 








1 



3. (a) Direct proof: 

(1) d^iaVc) 

(2) d 

(3) aVc 

(4) a^b 

(5) -.aV^ 

(6) c^b 

(7) -.cyh 

(8) (^ayb)A{^c\Jb) 

(9) a A ^ c) V 

(10) ^{a\Jc)\Jb 

(11) bu 
Indirect proof: 

(1) -^b 

(2) a^b 

(3) -a 

(4) c^b 
(5) 

(6) (-. a A -. c) 

(7) -(aVc) 

(8) d ^ (a V c) 
(9) 

(10) d 

(11) 
(b) Direct proof: 



Negated conclusion 
Premise 

Indirect Reasoning (1), (2) 
Premise 

Indirect Reasoning (1), (4) 
Conjunctive (3), (5) 
DeMorgan's law (6) 
Premise 

Indirect Reasoning (7), (8) 
Premise 
(9),(10)B 
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(1) (p^q)A(r^s) 

(2) p^q 

(3) (p^t)A(s^u) 

(4) q^t 

(5) p->t 

(6) r^s 

(7) J -» M 

(8) r-^u 

(9) p^r 

(10) p ^ M 

(1 1) p -» (f A m) Use (j: ^ y) A (■« -» z) ^ (y A z) 

(12) -i(tAu)^-ip 

(13) -.(Mm) 



(14) -pB 
Indirect proof: 

(1) P 

(2) p^q 

(3) ? 

(5) ? 

(6) -.(Mm) 

(7) ^ f V -. M 

(8) ^M 

(9) i ^ M 
(10) 

(ll)r^s 
(12) 

(13) p^r 

(14) r 

(15) ■ 
(c) Direct proof: 

(1) -iVp 

(2) 5 

(3) - (- s) 

(4) p 

(5) p^iq^r) 



Premise 

Added premise (conditional conclusion) 
Involution (2) 

Disjunctive simplification (1), (3) 
Premise 
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(6) q^r 

(7) q 

(8) r 
Indirect proof: 

(1) --(.s^r) 

(2) -(-iVr) 

(3) i A -. r 

(4) s 

(5) -iVp 
(6) 

(7) p 
(8) 

(9) q^r 

(10) ? 

(11) r 
(12) 
(13) 

(d) Direct proof: 

(1) P^q 

(2) q^r 

(3) p^?- 
(4) 

(5) -.pyr 

(6) (pVr)A(-pV'-) 

(7) (pA-p)V'- 

(8) V 

(9) rB 
Indirect proof: 

(1) 

(2) pyr 

(3) P 
(4) 

(5) q 

(6) ^^r 

(7) r 

(8) 



Detachment (4), (5) 
Premise 

Detachment (6), (7) I 



Negated conclusion 
Conditional equivalence (I) 
DeMorgan (2) 

Conjunctive simplification (3) 
Premise 

Conditional equivalence (5) 
Detachment (4), (6) 
Premise 

Detachment (7), (8) 
Premise 

Detachment (9), (10) 
Conjunctive simplification (3) 
Conjunction (11), (12) ■ 



Negated conclusion 
Premise 
(1),(2) 
Premise 

Detachment (3), (4) 
Premise 

Detachment (5), (6) 
(1),(7)« 



5. (a) Let W stand for "wages will increase," / stand for "there wiU be inflation," and C stand for "cost of living wiU increase." Therefore the 
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argument is: W ^ I, 
tion of all premises. 



-1 / ^ -1 C, W =^ C. The argument is invalid. The easiest way to see this is through a truth table. Let x be the conjunc- 



w 


/ 


c 


^ / 


^ c 


w ^ / 


^ / ^ ^ c 


.V 











1 


1 


1 














1 


1 





1 


1 








1 








1 


1 


1 








1 


1 








1 


1 





1 








1 


1 











1 





1 


1 








1 





1 


1 








1 


1 


1 


1 


1 


1 


1 








1 


1 


1 



(b) Let r stand for "the races are fixed," c stand for "casinos are crooked," t stand for "the tourist trade will decline," and p stand for "the police 
win be happy." Therefore, the argument is: 

(rV c) -* t, t -» p, -1 p -» -I r. The argument is vaUd. Proof: 

(1) t ^ p Premise 

(2) -1 p Premise 

(3) ->t Indirect Reasoning (1), (2) 

(4) (rV c) Premise 

(5) -.(rVc) Indirect Reasoning (3), (4) 

(6) (-.r)A(-.c) DeMorgan(5) 

(7) -1 r Conjunction simplification (6) ■ 

7. pi P), and Pf. pj^^^ implies p\ Pf.^^ . It takes two steps to get to p\ p^-^j from pi P), This means it takes 2 (100 - 1) steps to get to 
Pi Pioo (subtract 1 because pi p2 is stated as a premise). A final step is needed to apply detachment to imply pioo 

Section 3.6 

l.(a){{l},{3}, {1,3},0} 

(b) {{3}, {3, 4}, {3, 2}, {2, 3, 4}} 

(c) {{!}, {1,2}, {1,3}, {1,4}, {1,2,3}, {1,2,4}, {1,3,4}, {1,2,3,4}} 

(d) {{2), {3), {4), {2, 3}, {2, 4}, {3, 4}} 

(e) {AqU:\A\= 2} 

3. There are 2^ = 8 subsets of U, allowing for the possibMity of 2^nonequivalent 

propositions over U. 

5. s is odd and (s -l)(s- 3) (i - 5) (i - 7) = 
7. b and c 

Section 3.7 

1 . We wish to prove that P(n) : 1 + 3 + 5 + • • • + (2 - n) = is true forn & 1 . Note: The nth odd positive integer is 2n - 1 . 
Basis: forn = 1 : 1 = 1^ 

Induction: Assume that for somen & 1, p(_n) is true. Then: 

1 + 3 + ••• + (2(n + 1) - 1) + 1 = [1 + 3 + ••• + (2n - 1)] 
+(2(« + l)-l) 

= n^ + (2 n + 1) by pin) and basic algebra 
= (n + 1)2 ■ 

3. Proof: (a) Basis: 1 = 1 (2) (3)/6 = 1 
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n+1 n 

(b) Induction: 2 = Z*^ + (« + 1)^ 

1 1 



6 

(n+l)(2n2+7«+6) 



6 

(n+l)(n+2)(2n+3) 



5. Basis: For n = 1 , we observe that 



(1-2) (1+1) 

Induction: Assume that for some n s 1 , the formula is true. 
Then:^ + -+ ' " . i 



(1-2) ((n+l)(n+2)) (n+1) ((n+l)(n+2)) 

1 



: (n + 2) (n) + 



((n+l)(n+2)) 

(n+1)" 



((,1+1) («+2)) 

("+1) ^ 

(n+2) 



7. Let A„ be the set of strings of zeros and ones of length n (we assume that \A„\ = 2" is known), E„ = the even strings, and E'^ = the odd strings. 
The problem is to prove that for n s 1, \E„\ = 2""'. Clearly, l^'l = 1, and, if for some n s 1, \E„\ = 2""', it follows that \E„+i \ = 2" by the 
following reasoning: 

E„,,={ls:smE'„]U{Os:smE„] 

Since [Is: J inE^) and {Os : sinE„] are disjoint, we can apply the addition law. Therefore, |£„+i| = \E'^\ + \E„\ 

= 2"-i + (2" - 2"-i) = 2". ■ 

9. Assume that for n persons (n ? 1), " handshakes take place. If one more person enters the room, he or she will shake hands with n 
people,^+n=^^ = ^ 

_ ((n+l)-l)(«+l) 
2 

Also, forn = 1, there are no handshakes: =0. ■ 

2 

1 1 . Let p(_n) be "ai + 02 + • • • + a„ has the same value no matter how it is evaluated." 



Basis: ci +02 + 03 may be evaluated only two ways. Since + is associative, (a 1 + 02) + 03 = ai + (02 + a3).Hencep(3)istrue. 

Induction: Assume that for some n 3= 3 p(3), p(4), . . . , p(n)are all true. Now consider the sum + 02 + ••• + a„+i. Any of the n 
additions in this expression can be applied last. If the jth addition is applied last, we have cj = {ai + 02 + ■■■ + ajj + (aj+i + ••• + a„+i). No 

matter how the expression to the left and right of the addition are evaluated, the result will always be the same by the induction hypothesis, 
specifically p(J) and p(n + 1 - j). We now can prove that ci = C2 = ■■■ = c„. H i < j, 

C; = (ai + 02 + ••• + a,) + ((a;+i + ••• + a,) + (a,+ i + ••• + a„+i) 

= (ci +02 + ■■■ +ai) + ((oi+i + ••• +aj) + [oj+i + ■■■ +a„+i) 

= ((ai + ••• + a,) + (fli+i + ■■■ + aj)) + (aj+i + ■■• + a„+i) by p(3) 
= Cj 

Ci = (ai + 02 + ■■■ + a,) + (a,+i + • • • + a^- + ay+i + • • • + a„+i) definition of c,- 
= (ai +02 + ••• + a,) + ((aj+i + ••• + ay) + (a^+i + ••• + a„+i) by p(n + \ - i) 
= ((ai + ■ ■ ■ + a,) + (a,+i + ■ ■ ■ + aj)) + [aj+i + ■■■ + a„+i) by /7(3) 

= (ai + • • • + a,- + a,+i + • • • + ay) + [aj+i + ■■■ + a„+i) by 
= Cj definition of cj ■ 

13. For m 3; 1, let p(m) be x"'^"' = x" for aH n » 1. The basis for this proof follows direcfly from the basis for the definition of exponentiation. 

Induction: Assume that for some m & 1, p(m) is true. Then 

jji+(m+i) _ j.(n+m)+i ),y associativity of integer addition 
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= x"*"* by recursive definition 

= x" :>^x^ inductive hypothesis 

_ j^+i recursive definition ■ 

Section 3.8 

I. (a)(Vx)(FW^G(x)) 

(b) There are objects in the sea which are not fish. 

Every fish lives in the sea. 

3. (a) There is a book with a cover that is not blue. 

(b) Every mathematics book that is pubUshed in the United States has a blue cover. 

(c) There exists a mathematics book with a cover that is not blue. 

(d) There exists a book that appears in the bibUography of every mathematics book. 

(e) Ofx)(B(x) -^M(x)) 

(f) (3x)(M(x)A-f/W) 

(g) (3x)((Vj)(-/?(x,j)) 

5. The equation 4 - 9 = has a solution in the integers. (False) 

7. (a) Every subset of U has a cardinality different from its complement. (True) 

(b) There is a pair of disjoint subsets of U both having cardinality 5. (False) 

(c) A - B = - A*^ is a tautology. (True) 
9.(V a)Q (V b)Q{a + i is a rational number.) 

II. Let / = {1,2,3, ...,n} 

(a) (3!)/(xeA,) 

(b) (V!)/(xeA,) 
Section 3.9 

1 . The given statement can be written in if . . . , then . . . format as: If x and y are two odd positive integers, then x + yism even integer. 

Proof Assume x and y are two positive odd integers. It can be shown that x + y = 2 -(some positive integer). 
X odd X = 2 m + 1 for some meP, 
y odd y = 2 n + 1 for somen e P. 

Therefore, x + >' = (2m + 1) + (2n + 1) = 2 ((m + «) + !) = 2-(some positive integer) so x + y is even. ■ 

3. Proof: (Indirect) Assume to the contrary, that is a rational number. Then there exists p, q e^, (q ^ 0) where - = and where 

- is in lowest terms, that is, p and q have no common factor other than 1 . 

? 

- = V2~ => ^ = 2=>p^ = 2g^=> p^is an even integer => p is an even integer (see Exercise 2) 4 is a factor of p'^ =^ is even =^ q is even. 
Hence both p and q have a common factor, namely 2. Contradiction. ■ 

5. Proof: (Indirect) Assume x, y e R and x + y^l. Assume to the contrary that (xs^orys^) is false, which is equivalent to 
X > ^ and y > ~ - Hence x + >■ > ^ + ^ = 1 . This contradicts the assumption that x + >> s 1 . ■ 
Supplementary Exercises— Chapter 3 
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p 


py p p 


-^p 


pA^p 


l.(a) 







(b) 


1 







1 




1 1 








P 




p 


pV -'P 


p 


pAp 


(c) 


1 


1 


(d)0 





1 





1 


1 


1 



p 




p 


qA^p 


pViqA^p) 








1 











1 


1 


1 


1 


1 











1 


1 


1 








1 



5. Let a = p ^ -t q, b = q\/ r, and c = (p ^ ^ q) /\ (q\J r) l\ ^ r 



p 


q 


r 


-^q 


a 


b 


a [\b 


r 


c 


p 


c -> -1 p 











1 










1 





1 










1 


1 




1 


1 








1 







1 










1 


1 


1 


1 


1 







1 


1 







1 


1 








1 




1 








1 










1 










1 





1 


1 




1 


1 













1 


1 











1 





1 










1 


1 


1 








1 

















7. An implication is always equivalent to its contrapositive, as can be seen in the table below. 



p 


q 


p^q 


^q^ 


p 


(p^q)<r^{-^q=^^p) 








1 


1 









1 


1 


1 






1 















1 


1 


1 


1 






9. The truth tables of p 


(p A q) and x must be equal. 


P 


q 


p^(p Aq) 


X 








1 




1 





1 


1 




1 


1 













1 


1 


1 




1 



1 1 . (a) "3 is not a prime number or it is odd," or "3 is a composite number or it is odd." 

(b) "4 is not a prime number and it is even," or "4 is a composite number and it is even." 

(c) I can exhibit an example of a statement and I cannot prove it. 

(d) - 7x+ 12 = 0andx + 3 andx* 8 



13. (a) 



p 


q 


p^q 


^q^^p 


(p -» ?) <^ (-. 9 ^ -. 








1 


1 







1 


1 


1 




1 













1 


1 


1 


1 
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p 


1 




p^q 


q^p 


(p^q) /\{q^ p) 








1 


1 


1 


1 





1 





1 








1 











1 





1 


1 


1 


1 


1 


1 



Columns 3 and 6 are the same so (p <-> « [(p q) A(q ^ p)] is a tautology. 
15. Let q = "I quit my job," s = "will starve," and w = "I did my work." In 
symbolic form, the argument is ^ j) A w -» ^) A w The truth 

table for x =(_q ^ j) A w A w ^ -> j does not consist of all ones. 
For example, when q is false, s is true and when w is true, x is false. Therefore, 
X is not a tautology and the argument is not valid. 
17. [( m ^ p) A (e V -'P) A «]=!>-"«• Valid 
Proof: (Direct) 



(1) 


e\l ^p 


Premise 


(2) 


-1 e 


Premise 


(3) 


P 


(1), (2), disjunctive simplification 


(4) 


m ^ p 


Premise 


(5) 


m 


(3), (4), indirect reasoning ■ 


19. L(- 


p^^qA 


A(P ^ s) A(qy r)^ s. Valid 


Proof: (Direct) 




(1) 


-^p^^q 


Premise 


(2) 


q^p 


(1), Contrapositive 


(3) 


p ^ s 


Premise 


(4) 


q -> s 


(2), (3), Chain rule 


(5) 


qyr 


Premise 


(6) 


-1 r 


Premise 


(7) 


q 


(5), (6), Disjunctive simplification 


(8) 


s 


(4), (7), Detachment ■ 


(Indirect) 




(1) 


-1 s 


Negated conclusion 


(2) 


p s 


Premise 


(3) 


^p 


(1), (2), Indirect Reasoning 


(4) 


^p^^q 


Premise 


(5) 


^q 


(3), (4), Detachment 


(6) 


qyr 


Premise 


(7) 


r 


(5), (6), Disjunctive simplification 


(8) 


-1 r 


Premise 


(9) 





(7),(8)B 
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21. e-*i,i->d, d-^w=^e-iw 
Proof: 

(1) e -> ! Premise 

(2) i -> d Premise 

(3) e^d (1), (2), Chain rule 

(4) -> w Premise 

(5) e^w (3), (4), Chain rule ■ 

23 . Valid. Statement: t\/ d, ->e\/ j, -> j y r =^ -> t ^ r 
Proof; (direct) 

(1) ty d Premise 

(2) -'t-td (1), Conditional equivalence 

(3) V j Premise 

(4) d -> j (3), Conditional equivalence 

(5) jy r Premise 

(6) r (5), Conditional equivalence 

(7) -.f^r (2), (4), (6), Chain rule ■ 
25. (1) First show T^/^, c fl 

a e Tp^^ => a makes p Ag true 

=> a makes p true and a makes q true 
=>a &Tp and a 

(2) To prove TpOT^c Tp/^^ reverse the above steps. ■ 

27.60 = 6-10 = 2-3-2-5 = 22-3-5 
120 = 2-60 = 23-3-5 

\k- l)*\k)~ (n-»-l))!tt-l)! {n-k)\k\ 



(n-t+l)!(/fe-l) (»-*)!« 
n\k + n\(ji-k+l) 
(n-/fc+l)!« 

n!(n+l) (n + 1)! 



(n-ifc+l)!« (n+l-/fc)!H 
n + 



(b) Basis: (n = 1) : (jc + j)' = jc + j . 

Induction: Assume n^l and (x + y)" = 2 x""* >>* 
We will now prove {x + = ( " ^ ^ ) -^^^'^ 
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(x + =(,x + y)(,x + y)" 

= (.x + y) 1^=0 ( ^ ) * ^ induction hypothesis 

= X Z^=o { 1 ] y'^ + y I*=o f I ] / distribution 



kyk 



Let A: = - 1 in the second summation; remember to increase top. 

_ V« i ^ \ -Jl+^-k . v«+l I 1 v^-k+\ . 

M:)=("r)-o=(:::)- 



/t=0 ^ " ' 

31 . - (3 X) ((V y) (D(x) A T(y))) ^ (V x) (V j) A ^(j))) 
^(Vx)((3j)(- (D(x)Ar(j)))) 
^(Vx)((3y)(-D(x)V-7'(y))) 

All sailing is not dangerous or some fishing is 
not tedious. 

33. (a) Let U be the universe of all fish, ="x is kind to children," and six) ="x is a shark." (V x)u (-> six) -» 

(b) Let w(x) = "x is a wine drinker," 

c(x) ="x is very communicative," 
pix) = "x is a pawnbroker," 
and /!(x) = "x is honest" ; 

then (V X) ((3 y) ((wW ^ cW) V (pW ^ iKx) A - My))))) 

(c) Let pix) = "x is a clever philosopher," 

c(x) = "x is a cynic," 
and wix) = "x is a woman"; then 
(V X) ((3 y) iiipix) ^ cW) A iwix) ^ pW)) ^ (p(y) ^ (wCy) ^ c(y))))) 
35. (V a)K + (V + (3 n)p (na > b) 

CHAPTER 4 

Section 4.1 

1. (a) Assume thatx e A (condition of the conditional conclusion A c C). Since A c B, x e B by the definition of c. Be Cand x e B impUes 
that X e C Therefore, if x e A, then x e C. ■ 

(b) (Proof that A - B c A fl B') Let x be in A - B. Therefore, x is in A, but it is not in B; that is, x e A and x e B'' =* x e A fl B''- ■ 

(c) ( => ) Assume that A c B and A c C. Let x e A. By the two premises, x e B and x e C. Therefore, by the definition of 

intersection, x e B H C. ■ 

(d) (=>) (Indirect) Assume that AcC and 6'' is not a subset of A'' . Therefore, there exists xeB*^ that does not belong to A'^. 
X ^ A' => X e A. Therefore, x e A andx ^B,il contradiction to the assumption that A c B. ■ 

3.(a)If A = ZandB = 0,A - B = Z, while B - A = 0. 
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(b) IfA = (01andB = (1),(0, 1) e A xB,but(0, 1) is not in BxA. 

(c) LetA = 0,B = |0),andC = jl). 
5 . Proof: Let p («) be 

An(BiUB2U---UB„) 

= (An5,)U(AnB2)U---U(An5„). 

Basis: We must show that p(2) : A f] (Bi U ^2 ) = (A fl 5i) U H ^2) is true. This was done by several methods in section 4.1 . 
Induction: Assume for some n>2 that p(n) is true. Then 

A n (Bi u B2 u ■ ■ ■ u B„-n) = A n [(Si u S2 u ••• u 5„) u 
= (A n (5i u S2 u ••• u 5,,)) u (A n 

by p(2) 

= ((A n Si) u ••• u (A n 5,,)) u (A n 

by the induction hypothesis 

= (AnBi)U ••• U(AnB„)U(AnB„+i) ■ 

Section 4.2 

l.(a) 







B 

) 














a 










Wr BinC 






AUB 


(A U Bf 


1 





1 





1 





1 


1 








1 







(b) A B A'^ 

1 
Oil 
10 
110 

The last two columns are the same so the two sets must be equal. 

(c) (i)x e A U A => X e A or X e A by the definition of f] 

=> X e A by the idempotent law of logic 
Therefore.AlJAcA. 

(ii) X e A => X e A and x e A by conjunctive addition 

=> X e A U A 
Therefore, A c A IJ A and so we have A IJ A = A. ■ 

3. For all parts of this exercise, a reason should be supplied for each step. We have supplied reasons for part a only and 
left them out of the other parts to give you further practice. 

(a) A U (B - A) = A U (B n A' ) by Exercise 2b of Section 4.1 

= (A U B) n(A U A') by the distributive law 
= (A U B) n by the null law 

= (A U B) by the identity law ■ 
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Q)) A - B = A f]B'^ 

= B'' n A 

= - A<^ 

(c) Select any element, x, in A H C. One such element exists since A H C is not empty. 

xeAnC=>xeA and x e C 
=> X e B and x e C 
=>xeB n C 
^BnC*0 ■ 

(d) An(B-C)= AfKBnC') 

= (A n B n A') u (A n B n c<^) 
= (A n B) n (A'' u 
= (A n B) n (A u c)'' 

= (A-B)n(A-C) ■ 

(e) A-(BUC) = An(BUCr 

=An(B'nc'^) 

= (,Af]B')f](.Af]C') 
= (A-B)n(A-C) ■ 

3 1 2 1 3 2 2 3 1 

5. (a) A U B" n C (b) A n B U C n B (c) A U B U C<^ 

Section 4.3 

1. (a)(1), (2, 3, 4, 5), (6), (7, 8), (9, 10) 

(b) 2^ , as compared with 2'°. (1,2) is one of the 992 sets that can't be generated. 

3. Bi = {00, 01, 10, 11} andBj = {0, 00, 01} generate minsets {00, 01}, {0}, {10, 11}, and {A, 1}. Note: X is the nuU string, which has 
length zero. 

5. (a) B, n Bz = 

Bi n B'z = {0, 2, 4) 

B^nB2 = {l,5) 

B'ir\B'2 = {i) 

(b) 2^ , since there are 3 nonempty minsets. 

7. Let a e A. For each i, a e Bi, or a e Bf , since B, U Bf = A by the complement law. Let £); = B, if a e B,, and D = B,'' otherwise. Since a is in 
each Di, it must be in the minset Di fl ^2 ••■ fl D„. Now consider two different minsets Mi = DiC\ D2 ■■■ C\ D„, and M2 = Gi fl G2 ■■■ Pi G„, 
where each D, and G, is either B, or Bi". Since these minsets are not equal, D,- i= Gi, for some Therefore, 
M] n = n D2 ■■■ f] D„ f] n C2 ••• n C„ = 0, since two of the sets in the intersection are disjoint. Since every element of A is in a 
minset and the minsets are disjoint, the nonempty minsets must form a partition of A. ■ 

Section 4.4 

l.(a)An(BUA)=A 

(b) A n((B'^nA)UBr=0 

(c) (A n B'y U B = A'^ U B 

3. (a) (p A-(- q Ap)Vg)) «0 

(b) i^ip V Aq)^(i^p) Aq) 

5. The maxsets are: 

BiUB2 = {l,2, 3,5) 
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BiUB2'^ = {l,3,4, 5,6} 
Bi"UB2 = {l,2,3,4, 6} 

B,'UB2" = {2,4,5,6) 

They do not form a partition of A since it is not true that the intersection of any two of them is empty. A set is said to be in maxset normal form 
when it is expressed as the intersection of distinct nonempty maxsets or it is the universal set U. 

Supplementary Exercises— Chapter 4 

1. (a) Proof: (=>) (Indirect) Assume A c B and A\J {U - B) i^0.To prove that this cannot occur, let e A H (t^ - B). 
X e A n (i/ - B) 

Definition of complement => e A p) B'^ 
Definition of H => e A and a; e 

Definition of complement x e A and x 

Definition of subset => A is not a subset of B 

This contradicts the premise that A c B. Hence this part of the statement is proven. 

(<=) (Indirect) Assume Af](U -B) = 0, and A is not a subset of B. To prove that this cannot occur, let x e A such 
that x^B. 

X e A and x ^ B 

Definition of complement ^ x e A and x e B*^ 
Definition of fl x e A fl 5' 

Definition of complement => xeAf](U - B) 
Definition of disjoint ^Af]{U-B)t0 

But this cannot happen because it contradicts the assumption that A f] (U - B) = 0. Hence this part of the statement is proven and the proof is 
complete. 

(b) Proof: (Indirect) Assume U = A\jB,AC\B = 0, and Ai=U -B. One way in which A and U -B can be not 
equal is that A is not a subset of U - B. Let x e A and x^U - B. 

X e A and xfU-B 

Definition of complement => x e A and x e B 

Definition off! =^ x e Af} B 
Definition of disjoint A fl B + 

But this caimot happen because it contradicts the assumption that AC\B = 0. The other way A and U - B can differ is if U -B is not a 
subset of A, Let x $ A and x&U - B. We could infer from this assumption that xtAljB. Therefore, any way that we assume that A + U -B 
leads to a contradiction. 

(c) Proofi (=>) (Direct) Let x e A. 
A and B are disjoint 
Definition of disjoint x $ B 
Definition of complement => x e B*^ 

Therefore, A cB'' 

(<=) (Indirect) Assume that A c B'' and x e A p] B- 
xeAflfi 

Definition of intersection x e A and x e B 

Definition of complement =* x e A and x ^ B' 
Definition of subset => A is not a subset of B ■ 

3. (a) Proofi (Direct) Let A, B, and C be sets. 
Let(x, >•) e(A \JB)xC. 

Definition of Cartesian product x e (A IJ S) and y & C 

Definition of U => (x e A or x e B) and j e C 

Distributive law of logic => (x e A and y e C) or (x e B and y e C) 

Definition of Cartesian product ((x, y) e A x C) or 

((x, y) eBxC) 

Definition of U (x, y) e (A x C) U (B x C) ■ 

(b) We proved (A U B) x C c (A x C) U (B x C) in part a; we now must show (A x C) U (B x C) c (A U B) x C and we wiU be finished. 

5. Proofi (Indirect) Assume A, B, and C are subsets of ?7, A c B, B c C and is not a subset of A'^ . To prove that this carmot occur, let x e 
and X ^ A"^ by definition of subset. 
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X e C and x ^ A' 
Definition of complement 
Premise 

Definition of subset 
Premise 

Definition of subset 



=> x^C and x e A 
^ AcB 
^ xgB 
=>BcC 

=> X e C (Contradiction) ■ 
7. (a) Proof: (Indirect) Let A, B, and C be sets. Assume A U C + B U C and A = B. 



A=B^AqB 

xeAUC 

Definition of union =;> xeAorxeC 
Definition of subset =>xeBorxeC 
Definition of union => x e B IJ C 

Therefore, A\JCcB\JC.Bya similar line of reasoning we can infer B IJ C c A IJ C, which proves that 
AUC = BUC, a contradiction. 

(b) Proof: (Direct) Assume Ai=B and show A' i= B'^ . Since Ai^Bvit can assume that A is not a subset of B. The 
alternative is that B is not a subset of A and the remaining logic would be identical. 

A not a subset of B 

Definition of subset => x e A and x ^ B 
Definition of complement x ^ A'' and x e B' 

Definition of subset => B' is not a subset of A' 

Definition of inequaUty =^ A'^ i^B^ ■ 

9. (a) The minsets are Bi fl B2 = {3}, Bi<^ 0^2 = {2, 5}, Bj fl B2' = {!}, and Bi<^ fl = {4, 6} 
(b) The minsets are disjoint and 

(Bi n S2) u {Bi' n B2) u (Bi n 52"^) u (Bi'^ n B20 = j/, 

so the minsets form a partition of U. 

CHAPTER 5 

Sections 5.1-5.3 

1 . For parts c, d and i of this exercise, only a verification is needed. Here, we supply the result that will appear on both sides of the equality. 



(c) 



3 
15 



(f) B + = B 



V 1/3 j 



(d) 
(g) 



18 
-39 

CD «(o2) <"(,o:5) 



-15 
35 



15 
-35 



(e) 



-12 
5 



5 -5 
■25 25 



7. (a) Ax 



'10 
8 
,0 27 J 

_ /" 2 Xl + 1 X2 

V 1 Xi - 1 X2 

2xi + X2 = 3 andxi 



( 1 



equals 

-X2 = 



(b) (i) 



(ii) 



(iii) A = 



-1 
1 

112 
1 2 -1 

.1 3 1 
C 1 1 \ 
1 

.10 3, 





32 768 


3 
1 

1 are true 

xi ■ 

X2 

' \ 
X2 

(Xi 

yx3 







14348 907 ) 
if and only if both of the equaUties 



B = 



1 ^ 

-1 

5 

3 
5 
6 



Section 5.4 
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^'^^^(2/5^ - //^5 ) ( 1) No inverse exists. 

(-1/3 
(d) A-i = A (e) 2 

. -1/5 ; 

3. Let A and B be n by n invertible matrices. Prove (AB)"' = B"' A"^. 

Proof: (B-^ A-') (AS) = (fi-') (A-'(AS)) 

= (B-i)((A-iA)B)) 
= (B-i)(/B) 
= (B-')(6) 
= / 

Similarly, (AB) (B"' A ') = /. 

By Theorem 5.4.1, B"' A"' is the only inverse of AB, If we tried to invert AB with A"' B^^ we would be 
vmsuccessfiil since we cannot rearrange the order of the matrices. 

5 . (b) 1 = det / = det(AA~') = det A det A"' . Now solve for det A"' . 

7. Basis: (n = 1) : det A' = det A = (det A)'. 

Induction: Assume det A" = (det A)« for some n > 1 . 

det A"*' = det(A" A) by the definition of exponents 

= det(A") det(A) by exercise 5 

= (det A)" (det A) by the induction hypothesis 

= (detA)"+i 

9. (a)AssumeA = B£)B"' 

BasiS:(m = 1): A'\ 1 = Bfl' B"' is given. 

Induction: Assume that for some positive integer m. A" = BD" B~' 

= (BD™ B"') (BDB"') by the induction hypothesis 

= BU" DB~^ by associativity, definition of inverse 



(b) A'" = BD'" B 

Section 5.5 



= Bzr+' B-i 

9206 15 345 
6138 10231 



I. (1) Let A and B be m by n matrices. Then A + B = B + A, 

(2) Let A, B, and C be m by n matrices. Then A + (B + C) = (A + B) + C. 

(3) Let A and B be m by n matrices, and let c e R. Then c(A + B) = cA + cB, 

(4) Let A be an m by n matrix, and let Ci , C2 e R. Then (ci + C2) A = Cj A + C2 A. 

(5) Let A be an m by n matrix, and let ci , C2 e K. Then ci(c2 A) = (ci C2) A 

(6) Let be the zero matrix, of size m by n, and let A be a matrix of size n by r. Then A = = the m by r zero matrix. 

(7) Let A be an m by n matrix, and = the number zero. Then A = = the m by n zero matrix. 

(8) Let A be an m by n matrix, and let be the m by n zero matrix. Then A + = A. 

(9) Let A be an m by n matrix. Then A + (-l)A = 0, where is the m by n zero matrix. 

(10) Let A, B, and C be m by «, « by r, and n by r matrices respectively. Then A(B + C) = AB + AC. 

(1 1) Let A, B, and C be mby n, rby m, and r by m matrices respectively. Then (B + C) A = BA + CA. 

(12) Let A, B, and C be m by n, n by r, and rby p matrices respectively. Then A(BC) = (AB) C. 

(13) Let A be an m by n matrix, /„ the m by m identity matrix, and /„ the n by n identity matrix. Then I„A = AI„ = A 

(14) Let A be an n by n matrix. Then if A"' exists, (A"')"' =A. 

(15) Let A and B ben by n matrices. Then if A"' andB"' exist, (AB)"' =B"'A"'. 

/ 21 5 22 

3.(a)AB +AC = 



-9 -6 
(b) A(B + C) = AB + AC 
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(c) A 



(d) (A2)-i = (AA)-i = (A4-1) '=/-'=/ by part c 

Section 5.6 

1 . In elementary algebra (the algebra of real numbers), each of the given oddities does not exist. 

(i) AB may be different from BA . Not so in elementary algebra, since ah = ha by the commutative law of multiplication. 

(ii) There exist matrices A and B such that AB = 0, yet A and B i=O.Ia elementary algebra, the only way ab = is if either a or is zero. 
There are no exceptions. 

(iii) There exist matrices A,AtO, yet A^ = 0. In elementary algebra, = « a = 0. 

(iv) There exist matrices A^ = A. where Ai=0 and A=f= I.ln elementary algebra, a^=a<=>a = Oorl. 

(v) There exist matrices A where A^ =1 but Ai=I and A -/. In elementary algebra, a^ = l«a = lor-l. 
3. (a) det A => A~' exists, and if you multiply the equation A^ = A on both sides by A~' , you obtain A = /. 

( 1 \ 

(b) Counterexample: A = I ^ ^ I 
5.(a)A-i=(JJ^ ^^^^^ xi=4/3,andx2 = l/3 

(b) A-i=(J ";',] ^i=4,and^2 = 4 

(c) ^"'=i;',; o/'^J xi=2/3,andx2 = -l/3 
xi = 0, and X2 = l 

(e) The matrix of coefficients for this system has a zero determinant; therefore, it has no inverse. The system caimot 
be solved by this method. In fact, the system has no solution. 



1/3 


1/3 \ 


1/3 


-2/3) 


1 - 




1 - 


i) 


1/3 


1/3 \ 


1/3 


-2/3 j 


1/3 


1/3 \ 


1/3 


-2/3 j 



Supplementary Exercises— Chapter 5 

(x + y 5 \l 3 5\ (x + y = 3 (y = -l/2 
'[ -2 x-yj 1-2 41 \x-y = 4 \x = 7/2 
3. For n > 1 let pin) be AS" = B" A 



BasiS:(n = 1): AB' = 6' A is true as given in the statement of the problem. Therefore, p(l) is true. 

Induction: Assume n > 1 and p (n) is true. 

AS"*' =(AB")B 

= (B" A)B By the induction hypothesis 
= (B''B)A Bypd) 



B''*^A 



7. D has no inverse if det D = 0. 
detD = <^ 3 c - /(15) = 3c-60 = 0<=>c = 20 

9. (a) (A + Bf =A^+AB + BA + B'^ 

(b) (A + Bf=A^ + 2AB+B^ only if AB = BA. 

I \ /O 1 



11 \ 10 I \ 

1 1 . The implication is false. Both I ^ I and I ^ q self-inverting, but their product is not. 

(a h \ ~ 

T also solve A'- =1 

{\-aflb -a) 



CHAPTER 6 
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Section 6.1 

1. (a) (2, 4), (2, 8) (b) (2, 3), (2, 4), (5, 8) (c) (1, 1), (2, 4) 
3. (a) r = |(l,2), (2,3), (3,4), (4,5)1 

(b) P = ((1,3), (2,4), (3, 5)1 = ((X, J) : j = x + 2, x,:veA) 

(c) r3 = |(l,4), (2, 5)1 = ((x, >!) : >' = x + 3, X, J e A) 

5. (a) When n = 3, there are 27 pairs in the relation. 

(b) Imagine building a pair of disjoint subsets of S. For each element of S there are three places that it can go: into the first set of the ordered 
pair, into the second set, or into neither set. Therefore the number of pairs in the relation is 3", by the product rule. 

Section 6.2 

1. 



3. See Figure 13.1.1 of Section 13.1. 

5. A Hasse diagram cannot be used because not every set is related to itself. Also, {a} and {b} are related in both directions. 
Section 6.3 







(c) The graphs are the same if we disregard the names of the vertices 
3. (a) (i) 



(iv) 



(vii) 



reflexive (ii) 
not symmetric 
not antisymmetric 
transitive 

not reflexive (v) 
symmetric 
antisymmetric 
transitive 

not reflexive 
not symmetric 
not antisymmetric 
not transitive 



reflexive (iii) 
not symmetric 
antisymmetric 
transitive 

reflexive (vi) 

symmetric 

not antisymmetric 

transitive 



not reflexive 
not symmetric 
not antisymmetric 
not transitive 

reflexive 
not symmetric 
antisymmetric 
transitive 
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(b) Graphs ii and vi show partial ordering relations. Graph v is of an equivalence relation. 
5. (a) No, since for example | 1 - 1 | =0^2 

(b) Yes, since \ \ = \j-i\ 

(c) No, since | 2 - 4 | = 2 and | 4 - 6 | = 2, but | 2 - 6 | =4 + 2. 
(d) 

Gosc:© 

7. (b) c(0) = |0), c(l) = (1, 2, 3) = c(2) = c(3) 

(c) c(0) U c(l) = A and c(0) fl c(l) = 

(d) Let A be any set and let r be an equivalence relation on A. Let a be any element of A. a e c(a) since r is reflexive, so each element of A is 
in some equivalence class. Therefore, the union of all equivalence classes equals A. Next we show that any two equivalence classes are either 
identical or disjoint and we are done. Let c(a) and c (b) be two equivalence classes, and assume that c{a) f] c(b) 4= 0. We want to show that 
c(a) = c(h). To show that c(a) c c{b), letx e c(a). x e c(a) => arx. Also, there exists an element, y, of A that is in the intersection of c (a) and c (b) 
by our assumption. Therefore, 

ary and bry => ary and yrb (r is symmetric) 
=> arb (transitivity of r) 

Next, 

arx and arb => xra and arfc 
=> xrii 

=> X e c{b) 
Similarly, c(fo) c c(a). ■ 

9. (a) Equivalence Relation 

c(0) = |0), c(l) = ID, c(2) = (2, 3) = c(3), c(4) = |4, 5| = c(5), 
c(6) = |6, 7) = c(7) 

(b) Not an Equivalence Relation 

(c) Equivalence Relation 

c(0) = {0, 2, 4, 6) = c(2) = c(4) = c(6) 
c(l) = {1,3,5,7) = c(3) = c(5) = c(7) 

1 1 . (b) The proof follows from the biconditional equivalence in Table 3.4.2. 
(c) Apply the chain rule, 
(d) 

I 

1 

i 

I 

Section 6.4 
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□ 


4 


5 


6 


1 


fO 





\ 


2 


1 








3 





1 





4 


.0 





1 , 



and 



□ 6 7 8 

4 (-0 ^ 

5 10 

6 U 1 , 



(b) 



□ 


6 


7 


8 


1 


fO 





^ 


2 











3 


1 








4 


,0 


1 


J 



3. R : xry if and only if | x - y | = 1 . 
S : xsy if and only if x is less than y. 

5. The diagonal entries of the matrix for such a relation must be 1. When the three entries above the diagonal are determined, the entries below 
are also determined. Therefore, the answer is 2^. 



□ 


1 


2 


3 


4 




□ 


1 


2 


3 


4 


1 


rO 


1 





> 




1 


( 1 





1 


^ 


2 


1 





1 





and 


2 





1 





1 


3 





1 





1 




3 


1 





1 





4 


.0 





1 


. 




4 


,0 


1 





1 , 



(b) PQ = 



□ 


1 


2 


3 


4 


1 


(0 


1 





^ 


2 


1 





1 





3 





1 





1 


4 


,0 





1 


. 


□ 


1 


2 


3 


4 


1 


fO 


1 





> 


2 


1 





1 





3 





1 





1 


4 


,0 





1 


, 



9. (a) Reflexive: Rij = Rij for all i, j, therefore Ry < Rij 

Antisymmetric: Assume ify < Sy and Sy < Ry for all 1 < j <.n=^ R^ < Sy 

Transitive: Assume if, S, and T are matrices where ify < Sy-and Sy < Ty, for all 1 < ;', j < n. Then Ry < Ty for all 1 < j < n, and so 

/?<r. 

(b) (R% = Ru Rlj + Ri2R2j+-+ Rin Rnj 

< Su Si j + Sa S2j+-+ Sin S„j = (S% ^R^<S^ 

To verify that the converse is not true we need only one example. For n = 2, let R12 = 1 and all other entries equal 0, and let S be the 
zero matrix. Since R^ and are both the zero matrix, R^ < 5", but since Rn > S]2, R < Sis. false. 

(c) The matrices are defined on the same set A = [ai, 02, ■■■ , a„}. Let c(ai), / = 1, 2, ... , n be the equivalence classes defined by R and let 
d(a,) be those defined by S. Claim: c(a,) c d(ai). Let aj e ciai) => a; raj => Ry = 1 => Sy = 1 => a; saj aj e diai). 

Section 6.5 

3. (a) 
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(b) Example, 1^4 and using S one can go from 1 to 4 using a path of length 3. 

5. (a) Dsf initiori: Reflexive Closure. Let r be a relation on A. A reflexive closure of r is the smallest reflexive relation that contains r. 

Theorem: The reflexive closure of r is the union of r with x) : x e A) 

7. (a) By the definition of transitive closure, r+ is the smallest relation which contains r; therefore, it is transitive. The transitive closure of 
, is the smallest transitive relation that contains r+. Since r+ is transitive, (r+)+ = r+. 

(b) The transitive closure of a symmetric relation is symmetric, but it may not be reflexive. If one element is not related to any elements, 
then the transitive closure will not relate that element to others. 

Supplementary Exercises— Chapter 6 

1 . If Andy is the parent of Barbara and Barbara is the parent of Charles, then Andy is the grandparent of Charles. 
3. (a) r = ((-l,0), (0, 1),(1,2)) 

(b) s = ((-1, -1), (-1, 1), (0, 0), (1, -1), (1, 1), (2, 2)1 

(c) t=[(-\, 0), (-1, 1), (-1, 2), (0, -1), (0, 1), (0, 2), 

(1,-1), (1,0), (1,2), (2,-1), (2,0), (2, D) 

5. His main office should be at node 2. The least desirable location is at node 1. The arrows in both directions between nodes 1 and 2 represent 
a two-way street. 

7. (a) No. 

(b) Person a is friendly toward the most people so he/she would be chair person. 

(c) If "great personality" has any effect then person b becomes chairperson. 

(d) A seating arrangement does not exist, since persons c and d are only friendly toward one person each and they have to be seated 
between fM'o people they are friendly toward. 

9. In order for the relation "living in the same house" to be an equivalence relation we must assume that a person lives in only one house. 
1 1 .(a) r is an equivalence relation. 

(b) s is neither since s is not reflexive. 

(c) In order for ^ to be a partial ordering we rephrase it slightly; xsy iff x taller than y or x equals y. Why would xsy iff x is the same height 
as or taller than y be wrong? 

13. There are 16 places in the adjacency matrix for a relation on four elements, but for a symmetric relation those entries below the diagonal 
will be the same as above. Hence we are only concerned with 16 - 6 = 10 places. Each of the remaining entries may take on a value of either 
or 1 , so by the rule of products we have 2'" possible symmetric relations on a four element set. 
15. 
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15. (a) 



17. 



(b) 



(c) 





(i) {{a, a), (a, b), (b, a), (b, b), (c, c)) 

(ii) {(a, a), (a, c), (c, a), (c, b), (c, c)] 



(i) R^ = R 



(ii) = 



n \ o\ 

1 1 

.001, 

1 1 M 


V 1 1 1 , 





(b) r is not reflexive, not symmetric, not antisymmetric, and not transitive. 



(c) 



19. 



(' 1 1 1 
10 
10 
I 1 1 1 J 

(a) A5 is friendly to no one. 

(b) The U.S. Ambassador (Ai) should be the chairman of this committee, since he is friendly toward the most people. 

(c) The U.S. Ambassador can communicate to everyone on the committee. 



CHAPTER 7 

Section 7.1 

1. (a) Yes (b)Yes (c) No (d) No (e) Yes 
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3. (a) Range of/ = /(A) = |a, 6, c,t/) = B 

(b) Range of g = giA) = {a, h, d] 

(c) RangeofL = L(A) = |l) 

5. For each of the | A | elements of A, there are \ B \ possible images, so there are | B | • \ B \ • . . . • | B | = | B functions from A into B. 
Section 7.2 

1 . The only one-to-one function and the only onto function is /. 

3. (a) onto but not one-to-one (/i(0) = 

(b) one-to-one and onto 

(c) one-to-one but not onto 

(d) onto but not one-to-one 

(e) one-to-one but not onto 

(f) one-to-one but not onto 

5. Let X = {socks selected) and Y = (pairs of socks) and define f : X ^ Y where f{x) =the pair of socks that x belongs to . By the Pigeonhole 
principle, there exist two socks that were selected from the same pair. 

7. (a) f{n) = «, for example 

(b) f{n) = 1 , for example 

(c) None exist. 

(d) None exist. 

9. (a) Use i : ^ P defined by ^(x) = x+l. 

(b) Use the function/ : -> Z defined by /(xO = x/2 if x is even and f(x) = -(x + l)/2 if x is odd. 

(c) The proof is due to Georg Cantor (1845-1918), and involves listing the rationals through a definite procedure so that none are omitted and 
duplications are avoided. In the first row list all nonnegative rationals with denominator 1, in the second all nonnegative rationals with denomina- 
tor 2, etc. In this listing, of course, there are duplications, for example, 0/ 1 = 0/2 = 0, 1 / 1 = 3/3 = 1 , 6/4 = 9/6 = 3/2, etc. To obtain a list 
without duplications follow the arrows in the given array listing only the circled numbers. 



We obtain: 0, 1, 1 /2, 2, 3, 1 /3, 1 /4, 2/3, 3/2, 4/ 1, ... Each nonnegative rational appears in this list exactly once. We now must 
insert in this list the negative rationals, and follow the same scheme to obtain: 0, 1, -1, 1/2, -1/2, 2, -2, 3, -3, 1/3, -1/3, ... , which can 
be paired off with the elements of f*;! . 

11. Let / be any function from A into B. By the Pigeonhole principle with n= I, there exists an element of B that is the image of at least two 
elements of A. Therefore, / is not an injection. 

13. The proof is indirect and follows a technique called the Cantor diagonal process. Assume to the contrary that the set is countable, then the 
elements can be listed: 




(W6 • - • 




«!, 112, «3, ... where each n; is an infinite sequence of Os and Is. Consider the array: 



n, = nu n,2 "13 ••• 

"2 = "21 "22 "23 • ■ ■ 
"3 = 1131 "32 "33 ■ ■ ■ 



We assume that this array contains all infinite sequences of Os and Is. Consider the sequence s defined by 
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_10 ifnii = l 
ifnii = 

s differs from each n,- in the ;th position and so cannot be in the Ust. This is a contradiction, which completes our proof. 

Section 7.3 

f+ iffe=lorfe = 5 

1 . (a) gof:A^Ci& defined by (gof) (k) = i _ ^ ^ 2 3 4 

(b) No, since the domain of /' is not equal to the codomain of g. 

(c) No, since / is not surjective. 

(d) No, since g is not injective. 

3. (a) The permutations of A are ri, ri, j\ , j^, and ^3, defined in section 15.3 



(b, c) Inverse of Square of 

Permutation the permutation permutation 

; 

r\ r2 n 

ri ri n 

fi fi i 

/2 fi i 

fi fi i 



(d) Apply both Theorems 7.3.3 and 7.3.4; If /' and g are permutations of A, then they are both 
injections and their composition, /o^, is a injection, by Theorem 7.3.3. By 1 3A,f°g is also a 
surjection; therefore, /°g is a bijection on A, a permutation. 

(e) Proof by induction: 

Basis: (n = 1) The number of permutations of A is one, the identity function, and 1 ! = 1 . 

Induction: Assume that the number of permutations on a set with n elements, n > 1, is n!. Furthermore, assume that | A | = n + 1 
and that A contains an element called x. Let A' = A-[x]. We can reduce the definition of a permutation, /, on A to two steps. First, we select 
any one of the n! permutations on A'. (Note the use of the induction hypothesis.) Call it g. This permutation almost completely defines a 
permutation on A by /(a) = g{a) for all a in A', Next, we select the image of x, which can be done n + 1 different ways. To keep our function 
bijective, we must adjust / as follows: If we select fix) = y, then we must find the element, z, of A such that g{z) = y, and redefine the image of 
z to f(z) = X. If we had selected f(x) = x, then there is really no adjustment needed. By the rule of products, the number of ways that we can 
define / is n! (n + 1) = (n + 1)! ■ 

7.(a)/og(n) = n + 3 (b) /^(n) = n + 15 (c) foh(n) = + 5 

9. Tlieorem: if / : A -» S and / has an inverse, then that inverse is unique. 

Proof: Suppose that g and h are both inverses of /. 

8 = 8°'Ag 
= g°{f°h) 
= {.g°f)°h 
= iA°h 

= h ^ g = h # 



11. Proof of Theorem 7.3.2: Let x, jt' be elements of A such that go/W = g°f{x')\ that is, g(f{x)) = gijix')). Since g is injective, f(x) = f{x') 
and since / is injective, x = x'.m 

Proof of Theorem 7.3.3: Let be an element of C. We must show that there exists an element of A whose image under g°f is x. Since g is 
surjective, there exists an element of B, y, such that g(y) = x. Also, since / is a surjection, there exists an element of A, z, such that f(z) = y, 
g°f(z)=g(f(z)) = g(y) = x.m 

13. Basis: (n = 2): (/i 0/2)-! = /2-1 o/j-2 by exercise 10. 
Induction : Assume n > 2 and C/l 0/2 o . . . o/„)-i = 

/„-io... 0/2-1 o/r' 

Consider (/i °/2 ° • • • °/n+i)"^ 
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(/i °/2 ° • • ■ °/„+i)-' = ((/i °/2 ° • • • °/„+ir' 

by the Basis = /„+ T ' = (/i = ./i ° • ■ ■ ° ' 

by Induction hypothesis = y„+r' °[f„'' o ... 0^2"' °/r') 

=/n+r'°-°/2-'°/r'." 

15. Assume all functions are fiinctions on A. 

if°8)°h = f°ig°h) 
f°iA = iA°.f = f 

If/"^ andg"' exist, {g°f)~^ = f'^°S'^ and 
If/-' exists, (f-'y'=f. 

Supplementary Exercises— Chapter 7 

I. (a) Z (b) Z (c)/(-5) = 2|5|+l = ll 

(d) {1, 3, 5, 7, 9, . . .} = the set of odd positive integers 

(e) No, a = 5 ora = -5. 

3. No. Relations (iii) and (iv) are not functions because the domain is not all of the set A. The others are not functions since in each case at least 
one element of A is mapped to 2 different elements. Example for relation (i), a is mapped to both a and b. 

5. (a) The matrix of / can only have one 1 in each row. So if the domain of / has n elements the matrix of / wiU have n Is. 
(b) If / is a bijection, besides having only one 1 in each row, there can only be one 1 in each column. 

7. (a) Let /(n) = n ^ , Vnel^. Since / is a bijection from 1^ into A = {n ^ | n e Rl), 1^ and A have the same cardinaUty; so A is countable. 

(b) Let B = {l/n|neP}.g:I^^B defined by gin) = 1 /(n + 1) is the required bijection. 

(c) ThatC = Ci U C2 = {3, 9, 27, 81, ...)U{2,4,8, 16, ...)is countable follows from the proof of Exercise 8. Without using this proof, we 
can still prove that C is countable by using the list 2' , 3' , 2^, 3^, 2^, 3^ to define /j : -> C where h(a) =the number in position a + 1 in the list, 

9./ : AxB -»BxA defined by /(a, b) = (Jb, a) is a bijection, which is all that we need to prove that 

|AxB| =|BxA| 

I I . This "code" can be viewed as a function, a, on the set of all finite sequences of letters. For example, a(hat) = qmh. This encoding function 
will not work very well because it is not a bijection. For example, no sequence with a or f in it is in the range. Although a is not one-to-one, it is 
difficult to find two English words with the same image. 

13. (a) 10 (a -I- 10) (b)a + 20 (c) 10a djv 10 = a 

(d) (a + 10) div 10 = a div 10 + 1 
15.(a)/(i)=fcand/(c) = c 

{h)f{b) = amAf{c)=d 
la b\ 

17. Since det\ ^ ^ \ =ad- be there are four permutations of {a, b, c, d] that leave the determinant invariant. These permutations are the 

identity function, ai = {(a, d), (b, b), (c, c), (d, a)}, 02 = {(a, d), (b, c), (c, b), (d, a)}, and = {(a, a), (b, c), (c, b), (d, d)}. 
19. (a) Domain = positive real numbers, Codomain = Real numbers. 

CHAPTER 8 

Section 8.1 

l.C(5, 2)= C(4, 2) + C(4, 1) 

= C(3, 2) + C(3, 1) + C(3, 1) + C(3, 0) 

= C(3, 2) + 2C(3, 1) + 1 

= C(2, 2) + C(2, 1) + 2 (C(2, 1) + C(2, 0)) + 1 

= 3C(2, 1) + 4 

= 6 + 4= 10 

3. (a) p (x) in telescoping form: ((((x + 3) x - 15) x + 0) x + 1) x - 10 
(b) p(3) = ((((3 -H 3) 3 - 15) 3 - 0) 3 + 1) 3 - 10 = 74 
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5. The basis is not reached in a finite number of steps if you try to compute f(x) for a nonzero value of x. 
Section 8.2 

1. Basis: B(0) = 3 - + 2 = 2, as defined 

Induction: Assume: B{k) = 3 yfe + 2 for some A: > 0. 
B(k + 1) = B(k) + 3 

= (3k + 2) + 3 by the induction hypothesis 

= (3k + 3) + 2 

= 3 + 1) + 2, as desired. ■ 

3. Imagine drawing line k in one of the infinite regions that it passes through. That infinite region is divided into two infinite regions by line k. 
As line k is drawn through every one of the k - 1 previous lines, you enter another region that line k divides. Therefore, the number of regions 
is increased by k. 

5. For n greater than zero, M(n) = M{n - 1) + 1, and M(0) = 0. 

Section 8.3 

1. 5(/t) = 2 + 9* 

3. 5(/t) = 6(l/4)* 

5. S(k) = k^ - \0k + 25 

I. Sik) = i3 + k)5'' 

9. S(k) = (12 + 3k) + (k^ + 7 k- 22)2''-^ 

II. P()t) = 4(-3)* + 2*-5*+i 

13. (a) The characteristic equation isaa^-a-l=0, which has solutions a = (l + Vs") /2 and yS = (l - Vs") / 2, It is useful to point out that 
a + fi = I and a - 13 = Vs". The general solution is 

F{k) = biO^ + b2 0' . 

Using the initial conditions, we obtain the system: bi + b2 = l and bi a + b2 = 1. The solution to this system is 

fei = a/(a - j8) = (5 + V5")/2 V5" 
and 62 = y3/(ff-yS) = (5- V5")/2V5" 

Therefore the final solution is 

*vr)/.r'-((,-v?)/2)-] 

(h)Cr = F(,r+l) 
15. (a) D(n) = 2D(n - 1) + 1 forn > 2 andD(l) = 0. 
(b) D(n) = 2"-i - 1 

17. Solutions to the recurrence relation and its approximation are B(k) = (1 + c)* + (1 - c)* and Ba(k) = 1. Note how as k increases, B(^) grows 
in size, while Ba(k) stays constant. 

Section 8.4 

l.(a) S(n) = l/n! (c)?7(A:) = 1 /A:, an improvement. 

(b) T(k) = (-3)* k\ , no improvement. 

3 . (a) Tin) = 3 ([log^ nJ + D (c) V(n) = [logg nj + 1 
(b) T(n) = 2 
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5. The indicated substitution yields S(n) = S(n + 1). Since S(0) = r(l)/r(0) = 6, S(n) = 6 for aU n. Therefore T(n + 1) = 6 T(n) ^ T(n) = 6". 

7. (a) A good approximation to the solution of this recurrence relation is based on the following observation: n is a power of a power of two; 
that is, n is 2", where m = 2* , then Q(n) = 1 + Q(2"/^). By applying this recurrence relation k times we obtain Q(n) = k. Going back to the 

original form of n, logjW = 2* or logjClogjW) = k. We would expect that in general, Q(n) is [logjClogj n)J. We do not see any elementary 
method for arriving at an exact solution. 

(b) Suppose that n is a positive integer with 2*"' < n < 2*. Then n can be written in binary form, (a^_i a^_2 • • • 02 ai ao)two with a^_i = 1 and 
R(n) is equal to the sum 

k-i 

!=0 

If 2*"' < n < 2*, then we can estimate this sum to be between 2 n - 1 mdln + 1. Therefore, i?(n) « 2n. 
Section 8.5 

l.(a) 1,0,0,0,0,... (b) 5(1/2)* (c) 1, 1, 0, 0, 0, ... 

(d) 3 (-2)* + 3-3* 
3. (a) l/(l-9z) (b)(2-10z)/(l-6z + 5z2) 

(C) 1/(1 -z-z2) 

5. (a) 3/(1 -2z) + 2/(l +2z), 3-2* + 2(-2)* 

(b) 10/(1 -z) + 12/(2 -z), 10 + 6(1/2)*= 

(c) -l/(l-5z) + 7/(l-6z), 7-6*-5* 
7. (a) Uk 

(b) {5/3)k(k + l)(2k + l) + 5k{k + 1) 

k k k 

(c) E (i) (10 (A: - j)) = 10^27-102 / 

;=0 ;=0 .,=0 

= 5k^{k + l)-{5k(k+l){2k+l)l6) 
= {5l3)k{k+l){2k+l) 

(d) it(fe+ l)(2it + 7)/12 

9. Coefficients of z" through z' in (1 + 5 z) (2 + 4 z) (3 + 3 z) (4 + 2 z) (5 + z) 



k Number of ways of getting a score of k 

120 

1 1044 

2 2724 

3 2724 

4 1044 

5 120 



Supplementary Exercises— Chapter 8 

1. Let v{n) be the quantity in question. Since any positive digit can appear in a one-digit positive integer, v(l) = 9. Given an n digit number, 
n > 2, it can be thought of as an n - 1 digit number times ten plus a digit. This digit caimot be the same as the units digit of the n - 1 digit 
number. Therefore, by the product rule v(n) = 9 v(n - 1) forn > 2. 

3. (a) To execute Split with Lin = (1, 2, 3, 4), we must split the list into L 1 = (1, 3) and L2 = (2, 4). If you carefully examine the algorithm for 
a list of length 2, you will see that the output equals the input; therefore L 1 out = (1, 3) and L 2 out = (2, 4) and L out = (1,3, 2, 4). 
(b) Examine the results for r = 1, 2, 3 with numbers in binary form. Notice the symmetry with respect to the vertical hne. 
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L in L out 



1 1 

L in L out 

00 00 

r = 2 01 10 

10 01 

11 11 



L in L out 

000 000 

001 100 
010 010 

r = 3 Oil 110 

100 001 

101 101 

110 Oil 

111 111 



The integers in Lout are sorted so that ^^-2 ••• ^o)two appears in position (&o ■■■ ^r-i)two- 

5. This is not a closed form expression because the number of operations that are needed to compute the expression grows with n, B (n) in this 
form requires n additions and n - I multiplications. 

7 . Kathryn' s balance on her first birthday is$l = B(l).IfB(n)is her balance on her «th birthday ,n>2, then B{n) = I A B(n - I) + n. 

B(n) = B^"\n) + B'-P\n) = foi(l .1)" - (10 n + 1 10) 
B(l) = 1 => (l.l)fo, = 121 ^bi = 1.1 

Therefore = 121(1.1)"-' -(10n+ 110). On her 21st birthday, Kathryn will haveB(21)= 121 (1.1)20 - (210 + 1 10) = $494 .03. 

9. (a) If it takes X (n) moves to move n disks to peg 2, then we can transfer the « + 1 disk to peg 3 in one move and then transfer the n disks 
from peg 2 to peg 3 in X(n) moves, so X(n + 1) = X(n) + 1 + X(n) = 2 X(n) + 1 , or equivalently X(n) = 2 X(n - 1) + 1 . 

(b) X(n) = bi -2" - 1. Since it takes 1 move to transfer 1 disk from one peg to another, ^(1) = 1 ; so foi = 1 and X(n) = 2" - 1. We verify that 
X(3) = 7: 



Start; 1 












^(2) 












2 






2 
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3 
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2 
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-»(*) 






-(7) 








I 














2 






2 
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3 
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2 


J 


] 




3 






3 



11. The solution for n = 4* is Q(4*) = - (4*+' - l). This can be obtained in one of two ways. Either use the substitution S{k) = Q(4*), which 
yields S(k) = 4* + 5(/(: - 1), or note that 2(4*) = 4* + 2(4*^-1) = 4*^ + 4'^-' + 2(4*-^) = 4* + 4*-i + . . . + 4 + i . This finite geometric series has the 
closed form expression above. By similar means, 2(2-4*) = 2 2(4*) = ^ (4**' - l) 
13.G(5; z)=l+z + 2z^+4z^ + Sz*--- 

15. GiT;z) = G{S;cz) = Ii S(k)(czf = 

= Ii(S(k)c'')z'' 
Therefore, 7'(/t) = 5(/(:) cK 
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I CHAPTER 9 

Section 9.1 

1. In Figure 9.1.2, computer h can communicate with all other computers. In Figure 9.1.3, there ai'e roads to and from city b to all other cities. 
In Figure 9.1 .4, there is a door connecting room b to every other room in the house (including the outside). 

3. 



' Start I 




5. No, the maximum number of edges would be = 28. 
7. (a) C(n, 2) = 

(b) n - I, one edge for each vertex except the champion vertex. 
Section 9.2 
A Exercises 

1 . Estimate the number of vertices and edges in each of the following graphs. Would the graph be considered sparse? 

(a) A rough estimate of the number of vertices in the "world airline graph" would be the number of cities with population greater than or equal 
to 100,000. Using the Wolfram CityData function we can get this number: 

CityData[All] // Select [n, CityData [B, "Population"] >= 10^5 &] & // Length 

4257 

There are many smaller cities that have airports, but some of the metropolitan areas with clusters of large cities are served by only a few 
airports. 4000-5000 is probably a good guess. As for edges, that's a bit more difficult to estimate. It's certainly not a complete graph. 
Looking at some medium sized airports such as Manchester, NH, the average number of cities that you can go to directly is in the 50-100 range. 
So a very rough estimate would be '^^ - ^^oo _ jgg -^^q 

(b) The number of ASCII characters is 128. Each character would be connected to 8 others and so there are ^ =512 edges. 

(c) The Oxford English Dictionary as approximately a half-million words, although many are obsolete. Mathematica has a words database 
that is a bit less comprehensive, yet it isn't trivial. The nice thing about using the Wolfram data is that the number of vertices and edges can be 
counted. Here is the number of words and hence the number of vertices in the graph: 

WordData [All] //Length 

149191 

And here are the number of edges, determined in a not necessarily efficient manner. 

(Length [WordData [B ~~ , "Lookup"]] & /©WordData [All] ) // Apply [Plus, B] & 

876547 

The last output took about three hours using a MacBook Pro with a 2.5 GHz Intel Core 2 Duo Processor. It should also be pointed out that 
Wolfram's choice of "words" doesn't match the OED or a Scrabble player's dictionary. For example, "1900s" and "18-karat gold" are 
included among the list of words. Nevertheless, the number we have here are good ballpark estimates for Most interpretations of the "English 
words." 

3. Each graph is isomorphic to itself. In addition, G2 and G4 are isomorphic; and G3, G5, and Gg are isomorphic. 
Section 9.3 
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k 1 2 3 4 5 6 

, V[A:].found TTTFFT ... 
1- T/r/ifc o c <c ,(* = undefined) 

V[K].fTom 2 5 6 **5 

Depth Set 2 12**1 

3. If the number of vertices is n, there can be ''J" ^' vertices with one vertex not connected to any of the others. One more edge and cormectiv- 
ity is assured. 

5. Basis: = 1) Is the relation r'defined by vr' w if there is a path of length / from v to w? Yes, since vrw if and only if an edge, which is a path 
of length /, cormects v to w. 

Induction: Assume that vr* w if and only if there is a path of length k from v to w. We must show that vr**' w if and only if there is a path of 
length k + I from v to w. 

vr*"^' w => vr* y and yrw, for some vertex y. By the induction hypothesis, there is a path of length k from v to y. And by the basis, there is 
a path of length one from y to w. If we combine these two paths, we obtain a path of length k + 1 from v to w. Of course, if we start with a path 
of length fc + 1 from v to w, we have a path of length k from v to some vertex y and a path of length / from y to w. Therefore, 
vr* y and yrw => vr*"*"' w m 

Section 9.4 

1. Using a recent road map, it appears that a Eulerian circuit exists in New York City, not including the small islands that belong to the city. 
Lowell, Massachusetts, is located at the confluence of the Merrimack and Concord rivers and has several canals flowing through it. No Eulerian 

path exists for Lowell. 

3. Gray Code for the 4-cube: 



G4 = 



' 0000 ■ 
0001 
0011 
0010 
0110 
0111 
0101 
0100 
1100 
1101 

nil 

1110 

1010 
1011 
1001 

, 1000 , 

5. Any bridge between two land masses will be sufficient. To get a Eulerian circuit, you must add a second bridge that connects the two land 
masses that were not coimected by the first bridge. 

7. Theorem: Let G =(V, E)be a directed graph, C has a Eulerian circuit if (a) G is connected and (h) indeg(v) = outdeg(v) /or all v in V. 
There exists a Eulerian pathfrom Vj tov2 if (a) g is connected and (h) indeg(vi) = outdeg(vi) - 1, indeg(v2) = outdeg(v2) + 1 and for all other 
vertices in V the indegree and outdegree are equal. 

9. A round-robin tournament graph is rarely Eulerian. It wiU be Eulerian if it has an odd number of vertices and each vertex (team) wins exactly 
as many times as it loses. Every round-robin tournament graph has a Hanultonian path. This can be proven by induction on the number of 
vertices. 



Section 9.5 

1. The circuit would be Boston, Providence, Hartford, Concord, Montpelier, Augusta, Boston. It does matter where you start. If you start in 
Concord, for example, your mileage wiU be higher. 

3. (a) Optimal cost =2^ . 

Phase 1 cost = 2.4 -[l . 

Phase 2 cost =2.6^^. 
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(b) Optimal cost =2.60. 
Phase 1 cost =3.00. 

Phase 2 cost 2 V2". 

(c) A = (0.0, 0.5), B = (0.5, 0.0), C = (0.5, 1.0), D = (1.0, 0.5) 
There are 4 points; so we will divide the unit square into two strips. 

Optimal Path: (B, A, C, D) Distance = 2 -yT 

Phase I Path: (B, A, C, D) Distance = 2 yfT 

Phase 11 Path: (A, C, B, D) Distance = 2 + V2~ 

(d) A = (0, 0), B = (0.2, 0.6), C = (0.4, 0.1), D = (0.6, 0.8), E = (0.7, 0.5) 

There are 5 points; so we will divide the unit square into three strips. 

Optimal Path: (A, B, D, E, C) Distance = 2.31 

Phase I Path: (A, C, B, C, E) Distance = 2.57 

Phase n Path: (A, B, D, E, C) Distance = 2.31 

5. (a) f(c, d) = 2, fib, d) = 2, fid, k) = 5, fia, g) = l, and fig, k) = l. 

(b) There are three possible flow-augmenting paths. 

s, b, d, k with flow increase of 1. 
s, a, d, k with flow increase of 1, and 
s, a, g, k with flow increase of 2. 

(c) The new flow is never maximal, since another flow-augmenting path will always exist. For example, if s, b, d, k is used above, the new 
flow can be augmented by 2 units with s, a, g, k. 

1 . (a) Value of maximal flow =31. 
(b) Value of maximal flow =14. 



(c) Value of maximal flow = 


14. 


One way of obtaining this flow is: 




Step Flow - Augmenting Path 


Flow Added 


1 Source, A, Sink 


2 


2 Source, C, B, Sink 


3 


3 Source, E, D, Sink 


4 


4 Source, A, B, Sink 


1 


5 Source, C, D, Sink 


2 


6 Source, A, B, C, D, Sink 


2 



9. To locate the closest neighbor among the list of k other points on the unit square requires a time proportional to k. Therefore the time required 
for the closest-neighbor algorithm with n points is proportional to (n - 1) + (n - 2) + • • • + 2 + 1, which is proportional to n^. Since the strip 
algorithm takes a time proportional to n (log n), it is much faster for large values of n. 

11. LetP = Pi, P2, .... Pin be a set of points in the unit square. If S is a subset of P, define min(S) to be the point in S with smallest x coordi- 
nate. If there is a tie, select the point with smallest y coordinate. 

Matching Algorithm: 

1. S:=P 

2. While S + Do 

2.1 v:=min(5) 

2.2 w := closest point to V in 5 - (v) 

2.3 pair up V and w 
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2.4 S:=S-{v, w] 

Although this could be classified as a closest-neighbor algorithm, there is a better one, but it is more time-consuming. 
Section 9.6 

1 . Theorem 9.6.2 can be applied to infer that if « & 5 , then K„ is nonplanar. A K4. is the largest complete planar graph. 
3. (a) 3(b) 3 (c) 3 (d) 3 (e) 2 (f) 4 

5.n 

7. Suppose that G' is not connected. Then G' is made up of 2 components that are planar graphs with less than k edges, Gi and G2. For ( = 1 and 

2. let V;, r,, and ej be the number of vertices, regions and edges in G,-. 

By the induction hypothesis: 

vi + ri - ei = 2 
and V2 + r2 - 62 = 2 

One of the regions, the infinite one, is common to both graphs. Therefore, when we add edge e back to the graph, we have 
r = ri + r2 - I, v = Vi + V2, axide = ei + €2 + I. 

V + r - e + (vi + V2) + (rj + r2 - I) - (^i + 62 + I) 
= (vi + i-i - ei) + (V2 4- rz - 62) - 2 
= 2 -F 2 -2 
= 2 ■ 

9. Since \E\ 4- = " , either EorE^ has at least " elements. Assume that it is E that is larger. Since is greater than 

3«-6forn&ll,G would be nonplanar. Of course, if is larger, then G' would be nonplanar by the same reasoning. 

11. Suppose that (V, E) is bipartite (with colors red and blue), \E\ is odd, and (vi, V2, V2„+i, vi) is a Hamiltonian circuit. If vj is red, then 
V2n+i would also be red. But then (v2„+i, vil would not be in E, a contradiction. 

13. Draw a graph with one vertex for each edge. If two edges in the original graph meet at the same vertex, then draw an edge connecting the 
corresponding vertices in the new graph. 

Supplementary Exercises— Chapter 9 

1. Graphs G\ and G2 are isomorphic. One isomorphism between them is {(a, e), (b, h), (c, /), (d, g)|. To see that G3 is not isomorphic to the 
other two notice that k and j are not connected by an edge while in Gi and G2 every pair of vertices is connected. 

3. (a) {a, e] is a maximal independent set in Figure 9.1.2. 

(b) (By contradiction) Assume that W is a maximal independent set in G. If V is not connected to any vertex, U |v) is independent, and since 
this is a larger set, Wis not maximal. 

(c) A single vertex is maximal; no larger set can be independent. 
5. (a) 
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(b) (Mexico, Guatemala, Belize, Nicaragua, Costa Rica, Panama) 

(c) This path could be a list of the countries that you would go through in your trip. 

7. (a) If one source s exists, then {s, v) is on the edge of the round-robin tournament graph for each vertex v different from s. Therefore no other 
vertex could be a source. By similar reasoning, only one sink can exist. In a round-robin tournament, only one team can be unbeaten and only 
one can be winless. 

(b) If |V| = n, outdeg (iomce) = indeg {sink) = n - I 

(c) Let V = {vi , V2, . . . , v„). The following graph demonstrates that p /\^q is possible. Similar graphs can be drawn for the other situations. 




k 


1 


2 


3 


4 


5 


6 


7 


8 


9 


V [A:] .name 


a 


b 


c 


d 


e 


/ 


8 


h 


i 


VM. found 


T 


T 


T 


T 


T 


T 


T 


T 


T 


VM.from 


4 


1 


5 


5 


1 


3 


5 


5 


6 


depth set 


3 


1 


2 


2 


1 


3 


2 


2 


4 



1 1 . Gi is randomly Eulerian from no vertex, yet it is Eulerian. 

G2 is randomly Eulerian from only vertex 1 . 

G3 is randomly Eulerian from only vertices 1 and 2. 

G4 is randomly Eulerian from every vertex. 
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Gi 

2 3 





G4 



13. Addition of edges to E will certainly not decrease the degrees of each vertex. After adding some edges to E until no more can be added 
without allowing a Hamiltonian circuit, select e = [v], v„| not in the new, larger E. Since a Hamiltonian circuit exists in (G, E [j there is a 
path in G that visits every vertex in the order vi , V2, • • .v„. Now for 2 < / < «, if |vi , v,j e E, then 



Not both of the thicker red edges can be in E. 




|v,_i, v„| ^ E, for Otherwise, (vi, V2, 



., V,, vi) is a Hamiltonian circuit. 



Since {vi , v,} e i? => (v;_i , v„) ^ £ <ss> -> (|vi , v,j e E and {v,_i , v„) eE),no more than « - 1 of the possible edges that connect vi and v„ to other 
vertices could be in E, even after adding edges to E. Therefore, for the original graph, with |vi , v„} fE, deg v\ + deg v„ <n,a. contradiction. 

15. (a) fib, d) = f(c, d) = f(a, g) = fiy, t)=l, f(d, t) = 2, ViJ) = 3. 

(b) One flow-augmenting path is {s, a, g, t), which increased the flow value by 1, to 4. (A second one is (s, b, d, a, g, t).) 

(c) The new flow is maximal since its value is equal to the sum of capacities into the sink. 
17.(a) (A, D, F, E, C, B, A) 

(b) Starting at any city, it would take n -2 seconds to decide where to go first. Then it would take n - 3 seconds from the next step, and so on. 
The total time would be 



(n - 2) + (« - 3) + ■ ■ ■ + 2 + 1 + : 



1 



2(/i-2) (n-1) 



- seconds 



■ seconds, when n is large. 
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19. 




I CHAPTER 10 



Section 10.1 

1 . The number of trees are: (a) 1 , (b) 3, and (c) 16. The trees that connect are: 



E W E W 



E W E W 



□ 



E W E W 




E W E W 



s s s 

3. Hint: Use induction on \E\. 

5. (a) Assume that (V, E) is a tree with |V| > 2, and all but possibly one vertex in Vhas degree two or more. 
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2\E\ = Y.deg{v)>2\V\ -1 
ve V 

or \E\>\V\- '-^\E\>\V\^ (V, £) is not a tree. 

(b) The proof of this part is similar to part a in that we get 2 > 2 | V| - 1 , since a tree that is not a chain has a vertex with degree three or more. 
Section 10.2 

1 . It might not be most economical with respect to Objective 1 . You should be able to find an example to illustrate this claim. The new system 
can always be made most economical with respect to Objective 2 if the old system were designed with that objective in mind. 

3. In the figure below, |1, 2) is not a minimal bridge between L = {1, 4) and/? = |2, 3), but it is part of the minimal spanning tree for this graph. 




5, (a) Edges in one solution are: 

|8, 7|, |8, 9|, (8, 13),(7,6|, |9,41,|13, 12), (13, 14), |6, 11), {6, 1), (1, 2), {4, 3), {4, 5) {14, 15), and {5, 10) 

(b) Vertices 8 and 9 are at the center of the graph. Starting from vertex 8, a minimum diameter spanning tree is 
{|8, 3), {8, 7), {8, 13), {8, 14), {8, 9), |3, 2), (3, 4), (7, 6), (13, 12), (13, 19), (14, 15), (9, 16), 

(9, 10), (6, 1), (12, 18), (16, 20), (16, 17), (10, 11), (20, 21), (11, 5)). The diameter of the tree is 7. 

Section 10.3 

1. Locate any simple path of length d and locate the vertex in position rii/2]on the path. The tree rooted at that vertex will have a depth of 
[d/ll, which is minimal. 

3. 
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7. Solution #1: 

Basis: A binary tree consisting of a single vertex, which is a leaf, satisfies the equation leaves = internal vertices + 1 , 

Induction: Assume that for some > 1 , all full binary trees with k or fewer vertices have one more leaf than internal vertices. Now consider 
any full binary tree with k + 1 vertices. Let Ta and Tb be the left and right subtrees of the tree which, by the definition of a full binary tree, must 
both be full. If /^andifl are the numbers of internal vertices in TA&ndTB, and j^andjs are the numbers of leaves, then 
Ja = i'a + 1 and js = i's + 1 . Therefore, in the whole tree, the number of leaves = Ja + js 

= dA + 1) + (h + 1) 

= (iA + IB + 1) + 1 

= (number of internal vertices) + 1 

Solution #2: Imagine building a full binary tree starting with a single vertex. By continuing to add leaves in pairs so that the tree stays full, 
we can build any full binary tree. Our starting tree satisfies the condition that the number of leaves (1) is one more than the number of internal 
vertices (0). By adding a pair of leaves to a full binary tree, an old leaf becomes an internal vertex, increasing the number of internal vertices by 
one. Although we lose a leaf, the two added leaves create a net increase of one leaf. Therefore, the desired equality is maintained. 

Supplementary Exercises— Chapter 10 

1. Each of the « - 1 edges of a tree contributes to the degrees of two vertices. Therefore the sum of all degrees of vertices in an n vertex tree is 
2(n- l) = 2n-2. 

3. (a) G2 is graceful : vi = 1, V2 + 2, V3 = 4 

G4 is graceful : vi = 2, V2 = 1 , V3 = 3, V4 = 4 

(b) Starting at either end of the chain label the first vertex S(I) = 1 and the (k + 1) st vertex, k> I, S{k + I) = S(fe) + k. The edge connecting the 

kth and {k + 1) st vertex is the kth edge and since (S(k + 1) - s(k)) = k, the chain is graceful. The closed form expression for 

Sik)is I +{k{k -'-)). 

5. First, |3, 6| is added to the edge set, then |1, 2) and (3,4). Then {4, 6) is rejected since it would complete a cycle. This can be seen from the 
forest. 




Vertices 4 and 6 have the same root in this tree; hence |4, 6| is rejected. (1,5) and |2, 3) are the final edges that complete the minimal spanning 
tree. Notice that {4, 6) could have been the second edge selected. In that case, {3, 4) would be rejected. 

7. The depth of the tree is four. 
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Section 1 1 .1 

1. (a) Commutative, and associative. Notice that zero is the identity for addition, but it is not a positive integer.) 

(b) Commutative, associative, and has an identity (1) 

(c) Commutative, associative, has an identity (1), and is idempotent 

(d) Commutative, associative, and idempotent 

(e) None. Note: 2 @ (3 @ 3) = 512 

(2 @ 3) @ 3 = 64 

and while a@l=a,l@a = l. 

3. a, b & A C\ B a, b & A by the definition of intersection 

a * e A by the closure of A with respect to * 

Similarly, a, b e A C\B =^ a*b & Therefore, a * b & A P\ B. 

The set of positive integers is closed under addition, and so is the set of negative integers, but 1 H — 1 - 0. Therefore, their union, the nonzero 
integers, is not closed under addition. 

5. Let be the set of all noimegative integers (the natural numbers). 

(a) * is commutative since \a-b\ = \b - a\ for all a, e 

(b) * is not associative. Take a = l,b = 2, andc = 3, then 

(a*fo)*c = ||l-2|-3| = 2,and 
a * (6 * c) = |1 -12-311 = 0. 

(c) Zero is the identity for * on , since 

a*0 = |a + 0| = a = \0 - a\ = * a. 

(d) a"' = a for each a e f^, since 

a * a = \a - a\ =0. 

(e) * is not idempotent, since, for a + 0, 

a * a =0 i=a. 

Section 11.2 

1. The terms "generic" and "trade" for prescription drugs are analogous to "generic" and "concrete" algebraic systems. Generic aspirin, for 
example, has no name, whereas Bayer, Tylenol, Bufferin, and Anacin are all trade or specific types of aspirins. The same can be said of a 
generic group [G, *] where G is a nonempty set and * is a binary operation on G, When examples of typical domain elements can be given 
along with descriptions of how operations act on them, such as Q* or M2x2(IR), then the system is concrete (has a specific name, as with the 
aspirin). Generic is a way to describe a general algebraic system, whereas a concrete system has a name or symbols making it distinguishable 
from other systems. 

3. b, d, e, andf. 




abelian 





I 


Ri 


R2 


Fi 


F2 


Fi 


/ 


I 


Ri 


R2 


Fi 


F2 


F3 


Ri 


Ri 


R2 


I 


F2 


Fi 


Fi 


Ri 


R2 


I 


Ri 


F3 


Fi 


F2 


Fi 


Fi 


F 


F2 


I 


R2 


Ri 


F2 


F2 


Fi 


F3 


Ri 


I 


R2 


F3 


F3 


F2 


Fi 


R2 


Ri 


I 



This group is non-abeUan since, for example, Fi F2 = R2 and F2 Fi = i?2- 
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(c)4! = 24,n! 

7. The identity is e. a*b = c,a*c= b, b*c = a, and [V, *] is abelian. (This group is commonly called the Klein-4 group.) 

Section 11 .3 

1. (a) /is injective: fix) = f(y) =>a*x = a*y 

=^ X = y (by left cancellation) 

/is surjective: For all fo, f (x) = ^) has the solution a"' 

(b) Functions of the form f \x) = a + x, where a is any integer, are bijections 

3. Basis: (n = 2) (01*02)"' = a2~'*ai"' by Theorem 11.3.4. 
Induction: Assume that for some n > 2, 

(ai *a2* ••• *a„)"' = a„~^ *■■■* 02"' *ai'^ 
We must show that 

(ai *a2* ••• *a„*a„+i)"' = a„+i"' *a„~^ 02"' *ai~' 
This can be accomplished as follows: 

(ai *a2* ••• *a„*a„+i)"* = ((ai *a2 * ••• *a„)*a„+i)"' by the associative law 
= a„+i"' *(ai *a2* ••• *««)"' by the basis 
= a„+i~' *(a„~' * ••• * 02"' *ar') by the induction hypothesis 
= a„+i"' *a„"' *•••* 02 ' by the associative law ■ 

5. (a) Let p (n) be, where a is any element of group [G; *]. First we will prove that p (n) is true for all n > 0. 

First, we would need to prove a lemma that we leave to the reader, that if n > 0, and a is any group element, a*a" = a" *a. 

Basis: If n = 0, Using the definition of the zero exponent, (a") ' = e"' = e, while (a"')" = e. Therefore, p (0) is true. 
Induction: Assume that for some n > 0, p(n) is true. 

(a"*')~' = (a"*a)~' by the definition of exponentiation 
= a''*(a")"' by Theorem 11.3-4 
= a^'*(a"')" by the induction hypothesis 
= (a"')"*' by the lemma 
If n is negative, then -n is positive and 

a-- (((a-')-')-") 

= (a ' ) since the property is true for positive numbers 

= (a-')" 

(b) For m > 1, let p(m) be a"*"' = a" *a"' for all n> 1. The basis for this proof follows directly from the basis for the definition of 

exponentiation. 

Induction: Assume that for some m > 1 , p (m) is true. Then 

^n+Cm+i) _ ^(n+mj+i by the associativity of integer addition 
= a""^" * a' by the definition of exponentiation 
= {a" * a") * a' by the induction hypothesis 
= a"* (a'"* a') by associativity 
= a"* a""*' by the definition of exponentiation 

(c) Let p (m) be (a")"" = a"" for all integers n. 

Basis: (a™)" = e and a™" = a" = e therefore, p (0) is true. 
Induction; Assume that p (m) is true for some m > 0, 

(0")™+ ' = (a")"" * a" definition of exponentiation 
= a"™ * a" by the induction hypothesis 
= a""" + " by part (a) of this problem 

_ „n(m+l) 



Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. 



Solutions to Odd Numbered Exercises 



459 



Finally, if m is negative, we can verify that (a")" = a""' using many of the same steps as the proof of part (a). 
Section 1 1 .4 

1. (a) 2 (b)5 (c)0 

(d) (e) 2 (f) 2 

(g) 1 (h) 3 

3. (a) 1 (b) 1 (c)m(4) = r (4), where m = II q + r,0 < r < 11. 

5. Since the solutions, if they exist, must come from Z2 , substitution is the easiest approach. 

(a) 1 is the only solution, since 1- +2 1 = and 0- +7 1 = 1 

(b) No solutions, since 0- +2 +2 1 = 1 , and 1^ +2 1 +2 1 = 1 

7. Hint: Prove by induction on m that you can divide any positive integer into m. That is, let p (m) be "For all n greater than zero, there exist 
unique integers q and r such that. . . ." In the induction step, divide n into m - n. 

Section 1 1 .5 

1 . a and c 

3. [I, Ri, R2},{t, FilAI, Fi], and |/, F3) are all the proper subgroups of R3. 
5. (a)(l> = (5) = Z6 

(2) = (4) = |2, 4, 0) 

(3) = |3, 0) 
(0) = (0| 

(b)(l> = (5) = <7) = (11) =Zi2 

(2) = (10) = |2, 4, 6, 8, 10, 0) 

(3) = (9) = |3, 6, 9, 0) 

(4) = (8) = (4, 8, 0) 
(6) = (6, 0| 

(0) = (0) 
(c) (1) = (3) = (5) = <7> = Zg 
(2) = (6) = (2, 4, 6, 0) 
(4) = (4, 0| 
(0) = (0| 
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(d) Based on the ordering diagrams in parts a through c, we would expect to see an ordering diagram similar to the one for divides on 
{1, 2, 3, 4, 6, 8, 12, 24) (the divisors of 24) if we were to examine the subgroups of Z24. This is indeed the case. 

7. Assume that and K are subgroups of group G, and that, as in Figure 11.5.1, there are elements x & H — K and y & K — H. Consider the 

product X * y. Where could it be placed in the Venn diagram? If we can prove that it must lie in the outer region, H'^ f] K'^ = (H [j Kf, then we 
have proven that H \J K i& not closed under * and can't be a subgroup of G, Assume that x*y sH. Since x is in H, x"' is in H and so by 
closure 

X"' *(x * y) = y e H 
which is a contradiction. Similarly, x*>' fK. ■ 

One way to interpret this theorem is that no group is the union of two groups. 

Section 11.6 

1. Table of Z2X Z3 : 

y 





0, 


0, 1 


0, 2 


1 , 


1 , 1 


1, 2) 


{0, 0} 


{0, 0} 


{0, 1} 


(0, 2) 


(1, 0) 


(1, 1) 


{1, 2) 


{0, 1} 


{0, 1} 


{0, 2} 


{0, 0) 


{1, 1) 


(1, 2) 


{1, 0) 


{0, 2} 


{0, 2} 


{0, 0} 


{0, 1) 


{1, 2) 


{1, 0) 


{1, 1) 


{1, 0} 


{1, 0} 


{1/ 1) 


{1, 2) 


{0, 0) 


{0, 1) 


{0, 2} 


{1, 1} 


{1, 1} 


{1, 2} 


(1, 0) 


(0, 1) 


{0, 2) 


{0, 0) 


{1, 2} 


{1, 2} 


{1, 0} 


{1- 1) 


{0, 2) 


{0, 0) 


{0, 1) 



The only two proper subgroups are {(0, 0), (1, 0)} and {(0, 0), (0, 1), (0, 2)} 
3. (a) (i)a + b couldbe (1, 0)or(0, 1). 
(ii) a + b = (1, 1). 

(b) (i)a + b = couldbe (1, 0, 0), (0, 1, 0), or (0, 0, 1). 
(ii)a + b = (1, 1, 1). 

(c) (i) a + b has exactly one 1. 
(ii) a + b has all 1 ' j. 

5. (a) No, is not an element of Z xZ. 

(b) Yes. 

(c) No, (0, 0) is not an element of this set. 

(d) No, the set is not closed: (1, 1) + (2, 4) = (3, 5) and (3, 5) is not in the set. 

(e) Yes. 

Section 1 1 .7 

1. (a) Yes,/(n, x) = (_x, n)for(n, x) e Z x R is an isomorphism. 

(b) No, Z2 X Z has a finite proper subgroup while Z x Z does not. 

(c) No. 

(d) Yes. 

(e) No. 



(f) Yes, one isomorphism is defined by /(ai, 02, 03, 04) = 
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(g) Yes, one isomorphism is defined by /(ai, 02) = (ai, 10"^). 

(h) Yes. 

©Yes f{k)=k{l, 1). 

3 . Consider 3 groups Ci , C2 , and C3 with operations * , <> , and □, respectively.. We want to show that if Gi is isomorphic to G2 , and if G2 is 
isomorphic to G3 , then Gi is isomorphic to G3. 

Gi isomorphic to G2 => there exists an isomorphism / : Gi -» G2 

G2 isomorphic to G3 => there exists an isomorphism g : G2 -» G3 

If we compose g with/, we get the function go/ : Gi -» G3, By Theorems 7.3.2 and 7.3.3, g°/ is abijection, and if a, e Gi, 

{g''f)(.a*b) = gifia*h)) 

= gif(a)of{h)) since / is an isomorphism 
= g(f(a)) ag(f(b)) since g is an isomorphism 

= fe°/)(fl)*te°/)W 

Therefore, go/ is an isomorphism from G\ into G3 , proving that "is isomorphic to" is transitive. 

5. Zg, Z2 X Z4 , and Z2^l. One other is the fourth dihedral group, introduced in Section 15.3. 

7. Let G be an infinite cyclic group generated by a. Then, using multiplicative notation, G = {a" | n e Z}. 

The map T : G — > J. defined by T{a") = n is an isomorphism. This is indeed a function, since a" = a" implies n =m. Otherwise, a would 
have a finite order and would not generate G. 

(a) T is one-to-one, since T(a") = T(a"') implies n = m, so a" = a". 

(b) T is onto, since for any n&J., T(a") = n. 

(c) T(a''*a"') = Tia"*"') 

= n + m 

= T(a") + Tia"') 

Supplementary Exercises— Chapter 11 

1 . (a) With respect to V under +, the identity is a; and -a = a, -b = c, and -c = b. 

(b) With respect to V under ■, the identity is b. Inverses: fc"' = b, c"' = c, and a has no inverse, 

(c) • is distributive over + since x ■ (y + z) = x -y + x- zfoT each of the 27 ways that the variables x, y, and z can be assigned values from V. 
However, -1- is not distributive over • since b + (a -c) = b, while (b + a) ■ (b + c) = a, 

3. By Theorem 7.3.4 every bijection has an inverse, so = has the inverse property on S. If / e S, 

fof-i=f-iof = i ^/inverts/-', or (/-!)-' = /. 
Therefore, inversion of functions has the involution property. 

5. If a and b are odd integers, a = 2 j + \ and b = 2k + I for j, k & J., ab = j + l){2k + I) = 2(2 j k + j + k) + \, which is an 
odd integer. Since 1 is odd and 1 -1- 1 is even, the odds are not closed under addition, The even integers are closed under both addition and 
multiplication. If a and b are even, a = 2j and b = 2k for some j, k e Z., a + b = 2j + 2k = 2(j + k), which is even, and 
ab = (2 j)(2k) = 2(2 7 ^), which is also even. 

7. That GL (2, R) is a group follows from laws of matrix algebra. In addition to being associative, matrix multiplication on two-by-two 
matrices has an identity /, and if A e GL (2, R), it has an inverse by the definition of GL (2, R). The inverse of A is in GL(2,R) since it has an 
inverse: (A"')"' = A. 
9. If a, ft, c e R, 

(a * b) * c = (a + b + 5) * c 
=a+b+5+c+5 

= a + h + c + \0 

a * (b * c)is. also equal to a -i- -1- c -1- 10, and so * is associative. To find the identity we solve a * e = a for e: 

a * e = a =>a + e + 5= a =>e = — 5. 
If a is a real number, the inverse of a is determined by solving the equation a * x = -5; 

a*x = -5 =^a + x + 5 = -5 =^ x = -a - 10 
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Since a is real, -a - 10 is real, and so * has the inverse property. 

11. By Supplementary Exercise 2 of this chapter, the identity for * is 2 and * is associative. All that is left to show is that * has the inverse 

4 I 4 

property. If a eQ* ,atx = 2^x = -; hence a" = -, which is also a positive rational number. 
13. Recall that matrix multiplication is the operation on GL(2, IR). 

AXB = C =^ XB = A-^C (multiply on the left by A"') 
Z = A-'CB"VmultiplyontherightbyB"') 



(- 1 




( - --) 






2 


2 2 








{ID 








V 3 J 








^ 6 3/ 


(b) 4 


(c)0 


(d)3 





17. (a) <1> = {1}, <3>= {1, 3}, <5> = {1, 5},and<7> = {1, 7}. 

(b) No, because no cyclic subgroup equals U(Z.g). 

19. (a) A, B e SL(2, IR) => |A| = |B| = 1. 

|AB| = |A|-|B|= 1-1 = 1 AB eSL(2, R) 

SL(2, R) is closed with respect to matrix multiplication 

(b) |/| = 1 / e SL(2, R) 

(c) A eSL(2, R) |A| = 1 

|A-'| = |A|-' = 1 ^ A-' e SL(2, R) 

21. Yes, S is a submonoid of B3X3. The zero matrix is in S since it is the matrix of the empty relation, which is symmetric. Furthermore, if A 
and B are matrices of symmetric relations, 

(A + B)ij = Ajj + Bjj definition of matrix addition 

= Aji + Bjj since both A and 6 are symmetirc 

= (A + B)ji definition of matrix addition 

Therefore, A + B is symmetric, which means that it is the matrix of a symmetric relation and that relation is in 5. 

23. (a) (1,4, 20) (b) (-1, 0, -1, -1) (c)(l/3,4) (d)(-2, -3, -5) 

25. The groups in parts a and c are abelian, since each factor is abeUan. The group in part b is non-abeUan, since one of its factors, GL (2, R), is 
non-abelian. 

27, Since (4)= {0, 4, 8, 12) is a cyclic group and has order four, it must be isomorphic to Z4, 

29, (a) There exists a "dictionary" that allows us to translate between the two systems in such a way that any true fact in one is translated to a 

true fact in the other. 

(b) If one system is familiar to you, the other one should be familiar too. 

(c) If (p A g) <sf 0, and (p /\ q) ^ 0, then p <ts> 0. 

31 . The key to this exercise is to identify the fact that adding two complex numbers entails adding two pairs of numbers, the real and imaginary 
parts. If we simply rename these parts the first and second parts, then we are doing R^ addition. This suggests the function T : C — > R^ where 
T(a + bi) = (a, fo). For any two complex numbers a + ftiandc + di, 

T((a + bi) + (c + di)) = T((a + c) + (b + d)i) definition of + inC 
= [a + c, b + d) definition of r 
= {a, b) + {c, d) definition of + inR^ 
= T(a + bi) + T{c + di) definition of T 

Since T has an inverse {T~^{a, b) = a + bi),T\sdxi isomorphism and so the two groups are isomorphic. 

It should be noted that T is not the only isomorphism between these two groups. For example U{a + bi) = (b, a) defines an isomorphism. 

33. The key here is to realize that both groups consist of elements that are constructed from four real numbers and that you operate on elements 
by adding four different pairs of real numbers. An isomorphism from R"* into M2x2(IR) is 

T(a, b,c,d) = ^ j 

There are an infinite number of isomorphism in this case. This one is the most obvious. 
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CHAPTER 12 
Section 12.1 

l.(a){(4/3, 1/3)} 

(b) {(-3 - 0.5;«:3, U - 4x3, X3) \ X3} 

(c) {(-5, 14/5, 8/5)} 

(d) {(6.25 - 2.5x3, -0.75 + 0.5;i:3, X3) \ X3 
3. (a) {(1.2,2.6,4.5)} 

(b) {(-6^:3 + 5, 2;c3 + 1, ;<:3) I ;i:3 e R} 

(c) {(-9;«:3 + 3, 4, ;c3 ) I ;«:3 e R} 

(d) [(3x4 + 1, -2x4 + 2, X4 + 1, X4) I Jr4 e R} 
5. (a) {(3,0)} 



(b) 



R} 



1 


1 


2 


1 ' 




(' 1 


1 


2 


1 




1 


2 


4 


4 


-R1+R2 





1 


2 


3 




1 


3 


3 


, 


-Ri + R3 


.0 


2 


1 


-1 












-Ri + Ri 


( 1 








-2 ^ 










— > 





1 


2 


3 












-2R2 + R3 


.0 





-3 


-7 J 












1 








-2 > 












-> 





1 


2 


3 












--R3 








1 


7 

3 , 





Y i?3 + /?2 - 

The row reduction can be done with Mathematica: 
/ 1 1 2 1 



( \ -2 ^ 
12 3 

1 7 

3 J 



RowReduce 



12 4 4 

13 3 



-2 A 



_5 

3 

7 
3 



el 
1 
1 

In any case, the solution set is {(-2, -5/3, 7/3)} 
7. Proof: Since is the n x 1 matrix of O's, let's call it 0. Let S be the set of solutions to A Z = 0. If Zi and X2 be in S. Then 

A(X, + X2) = AX, + AX2 = + = 
so X\ + X2 & S; that is, S is closed under addition. 
The identity of R" is 0, which is in S. Finally, let X be in S. Then 

A(-X) = -(AX) = -0 = 0, 
and so -X is also in S. 

Section 12.2 



(a) 



/ 15 


30 \ 


11 


11 


3 


5 


, 11 


11 / 
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-20 


21 


9 


-- 1 


2 


2 


2 


2 


-1 








-4 


2 


1 





7 


7 




1 




2 


2 


2 / 



(b) 



(c) The inverse does not exist. When the augmented matrix is row-reduced (see below), the last row of the first half caimot be manipulated to 
match the identity matrix. 

1 0^ 

(d) -3 1 1 

. -4 1 2 , 

(e) The inverse does not exist. 



(f) 



' 9 -36 30 \ 
-36 192 -180 
, 30 -180 180 , 



5. The solutions are in the solution section of Section 12.1, exercise 1, We illustrate with the outiine of the solution to Exercise 1(c) of Section 
12.1. 



/ 1 


1 


2 




'Xi ' 




( 1 








1 


2 


-1 




X2 




-1 








, 1 


3 


1 




.X3 , 




, 5 












( 1 


1 


2 






5 


5 


-5 ^ 






1 


2 


-1 


1 

~ 5 


-2 


-1 


3 






.1 


3 


1 




1 


-2 


1 , 



' xi • 




f ^ 1 




14 


X2 




-1 




5 






. 5 , 




8 

. 5 . 



and 



Section 12.3 

3. (b) Yes 

7 . If the matrices are named B , A i , A2 , A3 , and A4 , then 



A 5 . -5 . 23 . 

- Ax + - A2 + yAs + Y A4. 



9.(a)lfxi = (1, 0),j<:2= (0, l),andy = {bx, ^)2),then 
y = hxxx+ h2X2. 
If X, = (3, 2),X2= (2, l),andj = (bi, M.then 
y =(-bi +2b2)xi +(2bi -3b2)x2. 
The second linear combination can be computed using Mathematica as follows. 
Solve[ci {3, 2} +C2 {2, 1} == {bi, bz}, {ci, C2}] 
{{ci^2b2-buC2^2bi-3b2}} 

(b) If y = {b\, 62) is any vector in , then 

y ={-'ibi+^b2)xi+{-bi+b2)x2 + {(i)Xi 

(c) One solution is to add any vector(s) to xi,X2, and x^ of part b. 

(d) 2,n 

(e) If the matrices are Ai, A2 , A3, andA4 , then 

^ I = xAi z + VA2 + ZA3 + WA4 
\z w I 

ao + aix + a2 x^ + a^x^ = ao(l) + ai{x) + a2(x^) + a3(x^) . 
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1 1 . (a) The set is linearly independent: let a and b be scalars such that a (4, 1) + i(l , 3) = (0, 0), then 



4a + h = and 
a + 3b= 

which has a = = as its only solutions. The set generates all of : let (a, b) be an arbitrary vector in . We want to show that we can 
always find scalars ySi and P2 such that jSi(4, 1) + y32(l. 3) = (a, b). This is equivalent to finding scalars such that 4y8i +yS2 = a and 



+ 3 /32 = h. This system has a unique solution /3i 



3a-b 



and ^2 



4b- 



-. Therefore, the set generates R^. 



11 ' ' 11 

13. (d) They are isomorphic. Once you have completed part (a) of this exercise, the following translation rules will give you the answer to parts 
(b) and (c), 

a b 



{a, b, c, d) ^ ( " ,\<-> a + bx + c£- + 
\c d I 

Section 12.4 



dx^ 



1 . (a) Any nonzero multiple of 



is an eigenvector associated with A = 1 . 



(b) Any nonzero multiple of ^ 2 j ^ eigenvector associated with A = 4. 



(c) Let xi = ^ and '"^2 = ^2^^) • verify that 01X1 + 02X2 = ^^! if and only if ci = C2 = 0. Therefore, {xi , X2} is linearly 

independent. 



3. (c) You should obtain 












lo 


1 







, depending on how you order the eigenvalues. 



5.(a)IfP = (^ _\ ). then P-1AP = ( J _° ). 

(b) IfP = (^ ;),thenP-'AP = (^ ° ). 

(c) KP = (-),thenP-..P = (-). 



(d) If P = 



r 1 -1 M 
-14 2 
,-1 1 1. 



thenP ' AP = 



f-2 ^ 

1 
,0 0, 



(e) A is not diagonaUzable. Five is a double root of the characteristic equation, but has an eigenspace with dimension only 1 . 



(f) If P = 



' 1 1 M 
-2 1 
, 1 -11. 



3 ^ 

then P-'AP = I 1 



,000; 

7. (b) This is a direct appUcation of the definition of matrix multipUcation. Let Ap) stand for the row of A, and let P^^ stand for the column 
of P. Hence the 7* column of the product A P is 

fAdjPW^ 
A,2) P^J^ 

Hence, (AP)*^^ = A(P*^^) for y = 1 , 2, . . . , n. Thus, each column of A P depends on A and the y* column of P. 

Section 12.5 

3. If we introduce the superfluous equation 1 = 0-Si^_i + 1 we have the system 

Sk =5S*.i+ 4 
1 = 0-Si_i + 1 
which, in matrix form, is: 
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Let A=^ ^ j . We want to diagonalize A; that is, find a matrix P such that P ' A P = D, where D is a diagonal matrix, or 
A =PDP-^ => A* =PD^P 



Diagonalizing A: 
|A-c/| = 



= (5-c)(l-c) 



5 -c 4 
1 -c 

The eigenvalues are c = landc = 5. If c = 1, 



(o ojlxz) (o) 

which implies j;i + = 0, or = -■^2. and so ^ j is an eigenvector associated with 1 . 
Ifc = 5, 

Therefore, ^ ^ j is an eigenvector associated with 5. Combining the two eigenvectors, we get 



and 



A* = 



1 I \ n Y fO -1 

-1 jU 5 j U 1 
" (-1 o)(o 50{l 1 ) 

Hence, (\'),(^^ =';')(») =(='-') ^-M^.S. = 5' - L 



( I I \ 



5. Since A = A' 



1 1 

lo 1 1 ; 

path of length 1 for every other pair of nodes, 
(b) The characteristic polynomial is 



t here are paths of length 1 from: node c to node a, node b to node h, and node a to node c; and there is 1 



\A-cI\ 



1 -c 1 
1 -c 1 
1 1 -c 



Solving the characteristic equation -c^ + 2c^ + c- 2 = 0we find solutions 1,2, and -1 . 
If c = 1 , we find the associated eigenvector by finding a nonzero solution to 



/O 1 ^ 




' -^1 ~ 




(0 \ 


1 -1 1 











.0 1 0, 




.X3 , 
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One of these, which will be the first column of P, is 



( 1 \ 


-1 . 





f-l 1 ^ 


'X, ■ 




(0 ■\ 




f ' 1 




If c = 2, the system 


1 -2 1 









yields eigenvectors, including 


1 


, which will be the second column of P 




, 1 -1 , 


.X3 , 




,0 . 




, 1 . 





If c = -1, then the system determining the eigenvectors is 



('2 1 > 








(0 \ 


1 1 1 











.0 1 2, 




.^3 , 




.0 , 



and we can select 



( ^ ) 

-2 

. 1 . 



although any nonzero multiple of this vector could be the third column of P. 











( 1 


1 


1 \ 


(c) Assembling the results of (b) we have P = 





1 


-2 










,-1 


1 


1 , 




■ 1-* ■ 




( 1 





^ 




A*= P 


2* 


p-i = p 





16 









. (-1)"* , 




,0 





1 . 





( \ 16 \ \ 
16 -2 
-1 16 1 , 



(6 5 5 \ 
5 6 5 
5 5 6, 



3 
\_ 

\ 6 



-i^ 

2 

\_ 1_ 
3 3 

_1_ 1_ 

3 b I 



Hence there are five different paths of length 4 between distinct vertices, and six different paths that start and end at the same vertex. The 
reader can verify these facts from Figure 12.4.1. 

^ »2. 
and e^*'^ - 



7. (a) 



' e e 
,0 



lo 



f e — e 




(b) Let be the zero matrix, 4-" = / + 0+ — + — + ...=/. 

(c) Assume that A and B commute. We will examine the first few terms in the product e'^ . The pattern that is established does continue in 
general. In what follows, it is important that AB = B A. For example, in the last step, (A + BY expands to A^ +AB + BA + B^, not 
A^ + 2AB + B^, if we can't assume commutativity. 



/ + A^ h — + / + B + — + — + 

\ 2 6 / \ 2 6 

I + A + B + — + AB + — + — + — + 

2 2 6 2 



+ — + • 

2 6 



= / + (A+B) + -(A'+ 2AB + B^)+ - {A' + 3A^B+ 3AB-+ B') - 

2 6 



■ I + (A + B)- 



(A + Bf + -(A + Bf + ■ 



[A + Bf iwlAB + A' + B'. 

(d) Since A and -A commute, we can apply part d; 
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= / by part b of this problem. 

Supplementary Exercises— Chapter 12 

1. (a) xi = X2 = x-} = 1 

(b) Xi = ^,X2= 0,X3 = ^ 

-8-4 1 ^ 
7 3-1 
1-5-2 I , 

5. Suppose that A~* exists and that ai(Axi) + a2(,Ax2) is equal to the zero vector, 0. By applying several laws of matrix algebra, this implies 
that 

A{ai xi + a2 X2) = => a\ x\ + 02X2 = ^ since A"' exists 

=> tti =^2 =0 since {xi, X2) is a basis 
=> |Axi is linearly independent 

To see that {Axi, AX2) also spans , let e IR^, we note that since [xi, X2] is a basis, it will span A"' b: 

ai xi +02X2= A"' b for some a\,a2 & R. 

Using laws of matrix algebra: 

ai{Ax]) + a2{Ax2) = A(ai xi+a2 X2) 
= A(A-' b) 



Hence, & is a linear combination of A xi and A jr2- 

If A has no inverse, then Ax = has a nonzero solution y, which is spaimed by the vectors xi and X2: y = UiXi + 02X2 , where not both of the 
a's are zero. 

Ay = A(ai xi + 0-2 X2) = 

=> Q'i(A JCi) + 0-2 (A JC2) = 

=>{Axi,Ax2) is linearly dependent 

7. (b) -X = X 



(c) 2^ = 64, since each entry can take on two possible values. 
9. A = P-^DP Ai<» = p-'Dioop 

/ 0.6 0.2 _ 1^ / 1 2 Ul'"" V 1 1 
I0.4 0.8 j" 3(2 -1 ){ 0.4'" JU -1 

Note: 0.4'«' = 1. 60694 x lO"''*' «0 . 
11. (a) A = 0, ±^^2 



r i i ^ 

3 3 

2 2 

V 3 3 



(b) B = PDP-^ 



1 ^ 

1 1 
10 -2 , 



M ^ 

4 
2, 



/ 1 
1 i 







1^ 

2 / 



13. (a) Let the vertices be a 1, 02, and ay, and use the convenient matrix representation 

G] CI2 ^3 



ai 
a2 
03 



f2 1 
1 3 
.1 1 0) 



one sees immediately, for example, that there are 3 different edges from 02 to 03, so that the multigraph is 



Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. 



Solutions to Odd Numbered Exercises 



469 




(5 2 3 
(b) 5 4 

. 3 1 3 J 

the graph that there are 3 different paths of length 2 from ai to 03 



and by Theorem 12.5.1, (A^); j is the number of paths of length 2 from a; to a,-. For example, the reader can verify from 



CHAPTER 13 
Section 13.1 

l.(a)l,5 (b)5 
(c) 30 (d) 30 

(e) See Figure 13.4.1 with 0= l,ai =2,02 = 3,03 =5, hi =6,h2 = 10, = 15, and 1 : 
3. Solution for Hasse diagram (b): 
(a) 



30 



lub 


ai 


02 


03 


04 


"5 


gib 




02 


03 


04 


QS 


ai 


ai 


"2 


03 


04 


05 


ai 


ai 










02 


02 


"2 


04 


04 


as 


«2 


ai 


a2 


Ol 


"2 


"2 


03 


03 


04 


03 


04 


05 


a 3 


ai 


a\ 


03 


a3 


a3 


04 


04 


04 


a^ 


04 




a4 


Ol 


02 


03 


04 


a^ 


05 


"5 




"5 


as 


as 


Os 




02 


03 


04 


Qs 



(b) oi is the least element and as is the greatest element. 
Partial solution for Hasse diagram (f): 

(a) lub(a2, a3) and lub( 04, as) do not exist. 

(b) No greatest element exists, but ai is the least element. 
5 . If and ' are distinct least elements, then 



< ' since is a least element 
' < since ' is a least element 

Section 13.2 



= ' by antisymmetry, a contradiction. I 



1. Assume to the contrary that a and b have two different greatest lower bounds, and call them g and h. Then g > h since g is a greatest lower 
bound and h>g since is a greatest lower bound. Therefore, by antisymmetry h = g. 

3. (a) See Table 13.3.1 for the statements of these laws. Most of the proofs follow from the definition of gcd and 1cm. 
(b) (partial) We prove two laws as examples. 

Commutative law of join: Let [L, V , A] be a lattice, a, b e L. We must prove that a \J b = b \J a. 

Proof: By the definition of least upper bound, a \J b > b and a\J b >a therefore, by Exercise 4, part c, a \/ b > b \/ a. Similarly, 
b y a > a y b, and by antisymmetry a \J b = b \J a. ■ 

Idempotent law (for join): We must prove that for all a e L,a V a = a. 
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Proof: By the reflexive property of <,a < a and hence, by 4(c), a < a V a. But a is an upper bound for a; hence a > a V a. By antisymmetry, 
a = a y a. ■ 

Section 13.3 

1. 

B Complement of B 






A 


la] 


{b,c} 


[b] 


[a, c) 


{c] 


{a, b} 


{a, b] 


{c] 


[a, c] 


ib) 


{b, c) 


la] 


A 






This lattice is a Boolean algebra since it is a distributive complemented lattice. 
3. aandg. 

5.(a)S*:a V b= arSa > b 

(b) S:AnB = AifAcB 
5"':AljB = AifADB 

(c) Yes 

(d) S-.pAq'^pifp^q 
S* : p \/ q ^ pifq ^ p 

(e) Yes 

7. Definition: Boolean Algebra Isomorphism. [B, A, V > -] is isomorphic to [B', A> V> -]if and only if there exists a function 
T:B^ B' such that 

(a) ris abijection; 

(b) Tia Ab) = T{a) A T{b) for all a, 6 e B 

(c) T{a yb) = T(a) V nb) for all a, i e B 

(d) T(S) = T{a) for all a eB. 
Section 13.4 

1 . (a) For a = 3 we must show that for each x e D30 one of the following is true: xA3 = 3orxA3 = l. We do this through the following 
table: 

X verification 
~l 1 A3 = l 

2 2A3=1 

3 3A3=3 

5 5A3=1 

6 6A3=3 
10 20 A 3 = 1 
15 15 A3 = 3 
3° 30 A3 = 3 

For a = 5, a similar verification can be performed. 

(b)6 = 2 V 3, 10 = 2 V 5, 15 = 3 V 5,and30 = 2 V 3 V 5. 

3.IfB = D30 30 then A = {2, 3, 5} and D30 is isomorphic to f(A), where 
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1 <^0 5^ (5) 

2 «-» (2) 10 {2, 5) 

3 <^ {3} 15 <-» {3, 5) 

6^ (2,3) 30 <^ {2,3,5 



Join «-» Union 

and Meet Intersection 

Complement «-> Set Complement 



5. Assume that or 1 is the third element of a Boolean algebra. Then there is only one possible set of tables for join and meet, all follow- 
ing from required properties of the Boolean algebra. 



V 





X 


1 


A 





X 


1 








X 


1 














X 


X 


X 


1 


X 





X 


X 


1 


1 


1 


1 


1 





X 


1 



Next, to find the complement of x we want y such that x /\ y = and x\l y = 1. No element satisfies both conditions; hence the lattice is not 
complemented and cannot be a Boolean algebra. The lack of a complement can also be seen from the ordering diagram from which A and V 

must be derived. 

7. Let X be any countably infinite set, such as the integers. A subset of X is cofinite if it is finite or its complement is finite. The set of all 
cofinite subsets of X is: 



.}. For each finite subset A of Z, map 



(a) Countably infinite - this might not be obvious, but here is a hint. Assume X = [xq, xi, xi, 
that set to the integer 

!=0 

You can do a similar thing to sets that have a finite complement, but map them to negative integers. Only one minor adjustment needs to 
be made to accommodate both the empty set and X. 

(b) Closed imder union 

(c) Closed imder intersection, and 

(d) Closed imder complementation. 

Therefore, if B = {A c Z : A is cofinite), then B is a coimtable Boolean algebra imder the usual set operations. 
Section 13.5 



l.(a) 



V 



(0, 0) 
(0, 1) 

(1,0) 

(1, 1) 



(0,0) (0,1) 



(0,0) (0,1) 

(0,1) (0,1) 

(1.0) (1,1) 

(1.1) (1,1) 



1,0) (1,1) 



A 



1.0) (1,1) 

1.1) (1,1) 

1.0) (1,1) 

1.1) (1,1) 



(0,0) 
(0, 1) 

(1,0) 

(1, 1) 



(0,0) (0,1) (1,0) (1,1) 



(0,0) (0,0) (0,0) (0,0) 

(0,0) (0,1) (0,0) (0,1) 

(0,0) (0,0) (1,0) (1,0) 

(0,0) (0,1) (10) (1,1) 



u 


u 


(0, 0) 


(1, 1) 


(0, 1) 


(1,0) 


(1,0) 


(0, 1) 


(1, 1) 


(0, 0) 



(b) The graphs are isomorphic. 

(c) (0,1) and (1,0) 

3.(a)(l, 0, 0, 0),(0, 1, 0, 0),(0, 0, 1, 0),and(0, 0, 0, 1) are the atoms, 
(b) The n-tuples of O's and 1 's with exactly one 1 . 
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Section 13.6 

1(a) 



Mi(xi, X2) 


= 




M2{Xu X2) 


= (xTAxB 






= (mAx2) 




Mnixi , X2) 


= (xi A xi) 






= (^1 A X2) 




Mf,{xx, X2) 


= ((xrAx3 V(^Ax2)) 


= xr 


Mi(xx , X2) 


= ((xTAxJ) V (xi Axi)) 


= x^ 


Mg(xi, X2) 


= ((xrAxi)VUi Ax2)) 


= ((xi Ax2)V(^Axi)) 


MgiXi, X2) 


= ((xrAx2) Vfe AxD) 


= ((xi AxDV(^Ax2)) 



Mio(xi , X2) = ((xT A X2) V (xi A X2)) = X2 
Mii(xi , X2) = ((xi A x3 V (^1 A X2)) = xi 
Mi2(xi, X2) = ((xT A ^ V (xTA X2) V (^1 A xD) = (xTV xi) 
Mi3(xi, X2) = ((xTAxJ) V (^Ax2) V (xi AX2)) = (xrVx2) 
Mi4(xi , X2) = ((xT A xj) V (^1 A x3 V (^1 A X2)) = (xi V x3 
Mi5(xi , X2) = ((xT A X2) V (-^i A xj) V (.Xi A X2)) = (xi V X2) 
Mieixu X2) = ((xT A xj) V (^A X2) V (J^i A x3 V (J^i A X2)) = 1 
(b) The truth table for the functions in part (a) are 





X2 


Ml 


M2 


Mj 


M4 


M5 


Mi 


M^ 


M, 


M, 


Mio 


Ml 1 M12 


M13 


Mm 


M15 


Mis 











1 











1 


1 


1 








1 


1 


1 





1 





1 








1 








1 








1 


1 


1 


1 





1 


1 


1 














1 








1 





1 





1 1 





1 


1 


1 


1 


1 














1 








1 





1 


1 


1 


1 


1 


1 



(C) /i(Xi, X2) =Mi5(Xi, X2) 
/2(Xi, X2) =Mi2(Xi, X2) 

Mxi, X2) =Mi(xi, X2) 

Mxi, X2) =Mi6(xi, X2) 

3. (a) The number of elements in the domain of /is 16 = 4^ = |Bp 

(b) With two variables, there are 4' = 256 different Boolean functions. With three variables, there are 4* = 65 536 different Boolean functions. 

(c) /(xi,x2) = (i A^A^Va A^A^2)V(i A^i A:^V(OA^i A^2) 

(d) Consider f.B^^B, defined by /(O, 0) = 0, /(O, 1) = 1, /(I, 0) = a, /(I, 1) = a, and /(O, a) = fo, with the images of all other pairs in 
defined arbitrarily. This function is not a Boolean function. If we assume that it is Boolean function then /can be computed with a Boolean 
expression M(xi , X2). This expression can be put into minterm normal form; 

M(xi , X2) = (ci A xf A xj) V (c2 A xT A X2) V (C3 A xi A xj) V (C4 A xi A X2) 

/(O, 0) = ^ M(0, 0) = ^ ci = 
/(O, 1) = 1 M(0, 0) = 1 => ci = 1 
/(l,0) = a =* M(0, 0) = a ci = a 
/(I, l) = a => M(0, 0) = a ^ ci = a 

Therefore, 

M(xi, X2) = (xfA X2) V (a A xi A X2) V (a A xi A X2) 

M(0, a) = (OA a) V (a A A a) V (a A A a) = a 
This contradicts /(O, a) = b, and so/is not a Boolean function. 
Section 13,7 
l.(a) 
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(b) f{Xi,X2,X3)= {{Xi +X2)-X3)-{Xi + Xj) 

= ((Xl +X2)+X^)-iXi +X2) 

= {Xl + X2) ■ (Xi + X2) + X3 ■ (Xi + Xt) 

= + xi- {x\ + X2) 

— Xj • (x^ + X2) 

(c) The Venn diagram for the function is: 




We can read off the minterm normal form from this diagram: 

f(Xiy X2 , X3) — X\ • X2 ' + • X2 ' X3 + X\ • X2 ' x^ 

(d) 

Simplified form: 
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*3- 



Mmcain rorm: 



A'l- 



> — f 








■r? 






■X:, / 














I 











I 


Q 


1 








1 


[] 




1 


1 





1 


1 




D 


Q 


1 










1 


1 


I 





1 










1 


1 







1 


1 


1 


I 


1 











Current will flow only when one of the switches xi or X2 is On and X3 is Off. 



(b) f(xi,X2, X3) = (((Xi •X2) +X3)-(X2 + X3))+.\:3 

placing ( ) ' s to indicate order of evaluation 

= (((^1 ■X2)-(X2)) + Xj) + X3 

by the distributive law of + over • 

= (Xi-(X2-X2)) + (X3 +X3) 

by the associative laws of • and + 

= (xi ■X2) + X3 

by the idempotent laws of • and + 



(c) 
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^2 





Supplementary Exercises— Chapter 13 

1 . (a) The following Sage input generates an ordering diagram. 
Poset({l:[2,3,5,7,ll],2:[4,6,10],3:[6,9],4:[6,8,12],5:[10],6:[12]}).plot() 




© © © 




(b) The ordering diagram for < is a chain 
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12 

/ 

11 

/ 

/ 

9 

/ 

8 

/ 

7 

/ 

6 

/ 

/ 

4 

/ 

3 

/ 

/ 

1 

3. (a)4 V 8 = 8,3 V 15 = 15,4 A 8 = 4,3 A 15 = 3,3 A 5 - 15. 
(b) Yes. Let a, h, c and assume that there are n primes, pi, P2, Pn that appear as factors ofa,b and c. Then we can write 



a = 


P\" Pi^ ■ 


■Pn'° 


b = 


P\" Pi'^ ■ 


■ P/" 


c = 


Pi'" P2*^ ■ 


Pn 



where each exponent is a nonnegative integer. The greatest common divisor and least common multiple of two integers such as a and b can be 
expressed in terms of these exponents. 

a t\b = god (a, h) = pi'"^ pz""" •■■ pn"" 
where = min(!V, jr) and 

aM b = lcm(a, b) = pi'^' pi'^^ ■■■ pj^" 
where Mr = maxOV, jr)- 

Based on this observation, we can compare a f\ (b \l c) and (a /\b) V (a Ac). The exponent of p, is minOV , max(7V, )) in a A V c) 
and max(min(i r , jr), min(iV , kr)) in (a A b) \/ (a A c). These two exponents are equal; this is easiest to verify by checking the possible 
relative sizes of iV, jr and kr- Therefore, the lattice is distributive. 

(c) The least element is 1 . There is no greatest element. 

5. (a) The ordering diagram is the one-cube in Figure 9.4.5. It is interesting to note that the poset relation is really the logical impUcation, =>, 
since 0, => 1, 1 => 1 are all true statements. 

(b) From the definitions of lub and gib and part (a) we have the tables 



A 





1 


V 





1 

















1 


1 





1 


1 


1 


1 



which are the logical tables for the connectives "and" and "or." 

(c) = L X L = {(0, 0), (0, 1), (1, 0), (1, 1)) where the poset relation < on and the binary operations A and V are all defined compo- 
nentwise so that, for example, (0, 1) < (1, 1), since in the two first coordinates, < 1 and in the two second coordinates, 1 < 1. Also, for 
example, (0, 1) A (1. 0) = (0 A 1. 1 A 0) = (0, 0). The operation tables are given in the solution of Exercise 1 Section 13.5. The Hasse 
diagram for L? is the two-cube. 

(d) The Hasse diagram for L? is the three-cube. Tables for A and V can 
easily be constructed where, for example, 

(1, 0, 0) V (0, 1, 0) = (1 V 0,0V 1,0V0) = (1, 1,0) 
7. (a) No. It is not true that every pair of elements in A has both a lub and a gib 
in A. For example, 10 V 4 does not exist in A. 
(b) Yes. For all a, b <= A,a +b. 
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a \J b = the maximum of a and 6, 

a /\ b = the minimum of a and 

9. (x + y) ■ (x + y) = X + (y -y) by the distributive law of + over • 
= X + by the complement law 
= X by the identity law 

The switching circuit diagram has a single switch labeled x. 
11. (a) 



X 


complement(s) of x 





1 


ai 


02, a^, an, flg 


ai 


fli, as 


03 


a\ , 05 


04 


a\ , 05 


«5 


a2, (Js, 04, 06 




fli, as 


1 






(b) No, it is not distributive, for if it were, complements would be unique. 

13.(a)D20 = |1, 2, 4, 5, 10, 20) contains 6 elements and so cannot be a Boolean algebra by Corollary 13.4.1. 

(b) £)27 = (1, 3, 9, 27) has four elements and so we cannot use Corollary 13.4.1 to rule it out as a Boolean algebra. However, 3 has no 
complement, which means that Dxi is not a Boolean algebra. 

(c) D35 = |1, 5, 7, 35) has 4 = 2- elements, and so that it may be a Boolean algebra by Corollary 13.4.1. We can confirm through the 
definition of a Boolean algebra that it is. 

(d) Notice that 210 = 2-3 -5 -7, which means that |Z)2iol = 16 = 2"* and so Corollary 13.4.1 can't be used to rule it out as a Boolean algebra. 
Indeed, D210 is a Boolean algebra, which can be confirmed by applying the definition of a Boolean algebra. 

15. (a) First, by definition of subsystem in Section 11.5, a sub-Boolean algebra of a Boolean algebra B is a subset W oi B which is a Boolean 
algebra under the same operations as B. Specifically, W must satisfy the conditions: 

(i) The and 1 of B must be in W, 

(ii) a e W =) a e W 

(iii) a,b(iW ^a\J b(iW&Ma Kb&W. 

Hence if W is to contain 4 elements it must be of the form {O, /5, ~j3, l). W\ = {(0, 0, 0), (0, 1, 1), (1, 0, 0), (1,1, 1)) is one such 
set. The 3-cube below illustrates this sub-Boolean algebra. 




There are two others that are isomorphic to this one, where Corollary 13.4.2, assures us of this isomorphism. 

(b) Again, the form of the sub-Boolean algebra with four elements must be {O, l). Since the 2" elements of B?' can be paired up with 

their complements to give us 2""' pairs, there are 2""' - 1 ways to select the elements j8 and j3 (0 and its complement, 1, are already selected). 
Of course, all of these sub-Boolean algebras are isomorphic. 
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(c) A sub-Boolean algebra with 2* elements must have k atoms; so the selection of k elements that will act as atoms can be considered in 
counting numbers of sub-Boolean algebras of a certain size. What is the number? We leave it to the reader in the general case. 

17. (xf A X2 A X3) V A jc^ A .\:3) V (^1 A X2 A X3) 

19. (a) Since each of the three variables can be any one of two values there are 2^ rows, (See Table 13.6.3 for an example.) For n variables 
there are 2" rows. 

(b) For each row, there can be any one of two truth values. Since there are 2' = 8 rows there are 2^ = 256 functions. For n variables and 
m = 2" rows, there are 2"' = 2^" functions. 




, [ 




(c) /(Xi, X2, X3) = ((Xi + X2 + X3)-Xf 4- Xj + Xx)-Xx -X^ 

= (X] -Xf -I- X2 -Xf -I- X3 -XT-I- Xi -I- X2) -Xi -Xt, 
= (0 -I- X2 -XT-I- X3 - xf -I- Xi -I- X2)-Xi -Xt, 
= (X2 -Xf-I- X3 - xf -I- Xi -I- X2)-Xi -X^ 

— X2 ' X\ • X] • X3 X3 • Xi • X\ • X3 -f- X^ ■ Xi • X3 -t- X2 ' Xi • X3 
= X2 -O-X^-l- X3 -O-Xjj^-I- Xi -X?)^ -I- X? -Xi -x^ 
= Xj • X3 -f- X2 ' Xi • X3 
= Xi-X]"-(l +x^ 

Switching and gate diagrams to be added. 
23 . (a) z = (xi + X2) + X2-X3 

(b) Z = (xT -I- X2) -I- X2-X3 

= (xT -I- X2) -I- (x^ -I- xj) 

= Xl+{X2+ xi) + X3 
= Xl + l+Xi 

= 1 

The circuit is always on, no gates are necessary. 

CHAPTER 14 
Section 14.1 

1. (a) Si is not a submonoid since the identity of [Zg , Xg], which is 1, is not in Si. S2 is a submonoid since 1 e ^2 and ^2 is closed under 
multiplication; that is, for all a, b e 52,aX8& is in ^2. 

(b) The identity of [^'^ is the identity function / : I'^ ^ ['il defined by / (a) = a, V a e If a e f^, / (a) = a < a, thus the identity of I'^^ is in 5i . 
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However, the image of 1 under any function in S2 is 2, and thus the identity of I^"' is not in S2, so S2 is not a submonoid. The composition of any 
two functions in Si , /and g, will be a function in Si : 

(/og)(n)= /(g(n)) <g(n) since /is in Si 
<n since g is in 5i 

Thus f og eSi, and the two conditions of a submonoid are satisfied and Si is a submonoid of F'sl^ . 
(c) The first set is a submonoid, but the second is not since the null set has a non-finite complement. 

3. The set oi n x n real matrices is a monoid under matrix multiplication. This follows from the laws of matrix algebra in Chapter 5. To prove 
that the set of stochastic matrices is a monoid over matrix multiplication, we need only show that the identity matrix is stochastic (this is 
obvious) and that the set of stochastic matrices is closed under matrix multiplication. Let A and Bben x n stochastic matrices. 

iAB)ij= I^aahj 
k=i 

The sum of the column is 

n n n n 

'Z(AB)ij= Y.aikhj+ Y.aikb^j+ — + Y.a„khj 

j=l k=l k=i k=l 

n 

= L{aikbkj + aikbkj + ■■■ +ankbkj} 

k=l 

n 

= 'LhMik +a\k+ ■■■ +ank) 

k=\ 
n 

- ^ bi.j since A is Stochastic 

k=l 

= 1 since B is Stochastic 

Section 14.2 

1. (a) For a character set of 350 symbols, the number of bits needed for each character is the smallest n such that 2" is greater than or equal to 

350. Since 2'' = 512 > 350 > 2** , 9 bits are needed, 

(b) 2'- = 4096 > 3500 > 2"; therefore, 12 bits are needed. 

3. This grammar defines the set of all strings over B for which each string is a palindrome (same string if read forward or backward). 
5. (a) Terminal symbols: The null string, 0, and 1 . 

Nonterminal symbols: S, E. 

Starting symbol: S. 

Production rules: 00 S, 5^015, S^IOS, S^llS, S ^ E, E^O, E^l 
This is a regular grammar. 

(b) Terminal symbols: The nuU string, 0, and 1. 

Nonterminal symbols: S, A, B, C 
Starting symbol: 5 

Production rules: S -» OA,S ^ 1A,S ->X,A -» OB,A IB, A X,B ^ OC,B IC,B -> A,C 0,C l,C \ 
This is a regular grammar. 

(c) See Exercise 3. This language is not regular. 

7. If .s is in A* and L is recursive, we can answer the question "Is s in by 

negating the answer to "Is s in L?" 

9 . (a) List the elements of each set x; in a sequence i , x; 2 . 3 . • • • • 
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c 



-Tji -^j; -t.-!3 J^M 

X\\^ X^2 X4} Jj4 



Then draw arrows as shown above and list the elements of the union in order established by this pattern: xn , X2\ , xi2, xi^, X22, x^i, x^i , x^j, 

Xl4t Xij, . . . 

(b) Each of the sets A' , A- , A-* , . . . are countable and A* is the union of these sets; hence A* is countable. 

Section 14.3 



X 


s 


Z(x, s) 


t(x, s) 


Deposit 25 f 


Locked 


Nothing 


Select 


Deposit 25 (Z 


Select 


Return 25 


Select 


Press S 


Locked 


Nothing 


Locked 


Press S 


Select 


Dispense S 


Locked 


Press P 


Locked 


Nothing 


Locked 


Press P 


Select 


Dispense P 


Locked 


Press B 


Locked 


Nothing 


Locked 


Press B 


Select 


Dispense B 


Locked 




3. (000, Oil, 101, 110, 1111 

5. (a) Input:10110,Output: 11011 => lOllOis in position 27 
Input: 00100, Output: 001 1 1 ^ 00100 is in position 7 
Input: 11111, Output: 10101 => 1 1 1 1 1 is in position 21 

C G„ \ 

(b) Let X = xi X2 ... Xn and recall that for « > 1 , G„+i = , ^ ,- , where G,/ is the reverse of G„. To prove that the Gray Code Decoder always 

V 1 <Jn / 

works, let p (ri) be the proposition "Starting in Copy state, x's output is the position of x in G„; and starting in Complement state, x's output is 
the position of x in G„''." That p(l) is true is easy to verify for both possible values of x, and 1 . Now assume that for some n> 1 , p (n) is true 
and consider x = xiX2 ...x„ . 

If xi =0, 
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X ' s output = followed by (x2 ■■■ x„ x„+ \Y s output starting in Copy 
= followed by . . . x„ x„+i) ' j position in G„ 
= j:' J position in G„+i 

If ;<:i = 1, 

x' s output = 1 followed by (xi ... x„ x„+\)' s output starting in Complement 
= 1 followed by (X2 ■■■ x„ Xn+i) ' s position in G/ 
= j:' J position in G„+i 

Section 14.4 



Input String 


a 


b 




c aa ab ac 


1 


(a, 1) 


(a 


2) (c 


3) {a, 1) {a, 2) (c, 3) 


2 


(a, 2) 


(a 


1) (c 


3) (a, 2) (a, 1) (c, 3) 


3 


(c,3) 


(c 


3) (c 


3) (c, 3) (c, 3) (c, 3) 


Input String 


ba 




bb 


be ca cb cc 


1 


(a. 


2) 


{a, 1) 


(c, 3) (c, 3) (c, 3) (c, 3) 


2 


(a. 


1) 


(a, 2) 


(c, 3) (c, 3) (c, 3) (c, 3) 


3 


(c, 


3) 


(c,3) 


(c, 3) (c, 3) (c, 3) (c, 3) 



We can see that Ta Ta = Taa = Ta, TaT^-- 

Ta n n 



Tat,= Tf,, etc. Therefore, we have the following monoid: 



Ta 

n n Ta T, 
Tc T, T, Tc 
Notice that Ta is the identity of this monoid. 



(b) 



Input String 


1 


2 11 12 


21 


22 








A 


C 


BAD 


D 


A 








B 


D 


ABC 


C 


B 








C 


A 


D C B 


B 


C 








D 


B 


C D A 


A 


D 








Input String 


111 112 121 


122 


211 


212 


221 


111 


A 


C 


B B 


C 


B 


C 


C 


B 


B 


D 


A A 


D 


A 


D 


D 


A 


C 


B 


C C 


B 


C 


B 


B 


C 


D 


B 


c c 


B 


C 


B 


B 


C 



We have the following monoid: 





Ti 


T2 


Tn 


Tn 


Ti 


Tn 


Tl2 


Ti 


T2 


Ti 


Tb 


Tn 


T2 


Ti 


Tn 


Tt 


T2 


Tn 


Tn 


Tn 


T2 


Ti 


Tn 


Tn 



Notice that Tn is the identity of this monoid. 

3. Yes, just consider the unit time delay machine of Figure 14.4.2. Its monoid is described by the table at the end of Section 14.4 where the 
row and ^;^ column are omitted. Next consider the machine in Figure 14.5.3. The monoid of this machine is: 



Tx 

To 
T, 
Tn) 
Tm 
Tw 
Tn 



'0 



Ti To, 



Tn 



Too Toi Tio Tn 

Too Toi Tio Tn 

Too To] T\o Tn 

Too Toi Tio T^i 

Too Toi Tio Tn 

Tio Too Toi Too Toi T^ Tn 

Tn Tio Tn Too Toi Tio Tn 



Tx To Ti 

To Too Toi 

T, Tio Tn 

Too Too To\ 

Toi Tio Tn 
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Hence both of these machines have the same monoid, however, their transition diagrams are nonisomorphic since the first has two vertices and 
the second has seven. 

Section 14.5 

1. (a) 




(b) 



11,01 



(l,01,(u^ 



||0,01,|0,^>h-H^01,|l, u 
10,01 




Supplementary Exercises— Chapter 14 

1. Let/, g, h e M,anda e B. 

((f*g)*.h)(a) = (f*g)(a) A h(a) 

= (f(a)Ag(a))Ah(a) 
= f(a) M gia) A h(a)) 
= f(a) A(g*h) (a) 
= (f *ig * h)) (a) 
Therefore (f * g) * h = f * (g * h) => * is associative. 

The identity for * is the function u e M where u{a) = 1 = the "one" of B. If a e B 

(f*u) (a) = f(a) A u(a) = f(a) A 1 = /(a) 
Therefore f * u - f. Similarly u * f = f. 

There are 2- = 4 functions in M for B = B 2 . These four functions are named in the text (see Figure 14.1.1). The table for * is 

z i t u 



Z I Z I 

z z t t 
Z u t u 
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3. {a, bb, bbb, bbbb, . . .] 

5. S = start symbol. Nonterminals = {5, Bq , Bi, B2) 

S->Bo Bo->aBo Bo^bB, 

B, ^aBi B, -^bB2 Bi^b 

B2 ^ aB2 B2 ^ a 

1. 





Reject 




9. (a) 



00,l(](|n^ Left 



-4e/4- 



Right ^^^^,11/0 



11/1 



Down 



oo,o>;»ni/o 



(b) The possible output sequences are 100, 010, 001, and 111. Note: Output for f = 3 is determined by the next state, ^(4)). If s(A) = i(3), 
output at J = 3 is 0, while if ^(4) + .s(3), output at J = 3 is 1 . 

11. 
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I CHAPTER 15 



Section 15.1 

1. The only other generator is -1 . 

3. If \G\ = m ,m>2, and G = (a), then a,a^, a™ ' , a"* = e are distinct elements of G. Furthermore, a~ 
generates a*: 

(0-1)""-*= (a»'-l)'"-*= + * 

Similarly, if G is infinite and G = (a), then a"' generates G. 

5. (a) No. Assume that q & Q generates Q. Then (q) = [nq : n &Z}. But this gives us at most integer multiples of q, not every element in Q. 

(b) No. Similar reasoning to part a. 

(c) Yes. 6 is a generator of 6 Z. 

(d) No. 

(e) Yes, (1,1, 1) is a generator of the group. 

7. Theorem 15.1.4 implies that a generates Z„ if and only if the greatest common divisor of n and a is 1 (i. e., n and a are relatively prime). 
Therefore the list of generators of Z„ are the integers in Z„ that are relatively prime to n. The generators of Z25 are all of the nonzero elements 
except 5, 10, 15, and 20. The generators of Zjjj are the odd integers in Z256 since 256 is 2**. Mathematica expression to generate these sets are 
Select [Range [0, 24], Functlon[a, 6CD[25, a] = 1]] 

{1, 2, 3, 4, 6, 7, 8, 9, 11, 12, 13, 14, 16, 17, 18, 19, 21, 22, 23, 24} 

Select [Range [0, 255], Functlon[a, 6CD[256, a] = 1]] 

{1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 
167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 
213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255) 

9. (a) e■.Z^^ -> Z7 x Zu 

21 ^ (0, 10) 

5 ^ (5, 5) 

7 ^ (0, 7) 

15 ^ (1,4) 

sum = 48 «- (6, 4) = sum 

The final sum, 48, is obtained by using the facts that e-\l , 0) = 22 and 0~'(O, 1) = 56 

e-\6, 4) = 6 x^^e-\l, 0) + 4 X77r'(0, 1) 
= 6X7722 +774x7756 
= 55 +77 70 
= 48 

(b) Using the same isomorphism: 

25 ^ (4,3) 

26 ^ (5,4) 
40 ^ (5,7) 

sum = (0, 3) 

r'(0, 3)= 3x77 0-1(0, 1) 
= 3x7756 
= 14 

The actual sum is 91. Our result is incorrect, since 91 is not in Z77. Notice that 91 and 14 differ by 77. Any error that we get using this 
technique will be a multiple of 77. 
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Section 15.2 

1 . Call the subsets A and B respectively. If we choose e A and 5 e B we get +io 5 = 5 e B. On the other hand, if we choose 3 e A and 
8 eB, we get 3 +108 = 1 e A. Therefore, the induced operation is not well defined on {A, B}. 

3. (a) The four distinct cosets in G /H are 
H = ((0, 0), (2, 0)) 
(1, 0)+H = {(1, 0), (3, 0)} 
(0, l)+H = {(0, 1), (2, 1)}, 
and(l, I) +H= {(1, 1), (3, 1)} 
None of these cosets generates G/H; therefore G/H is not cyclic. Hence G/H must be isomorphic to Z2XZ2 . 

(b) The factor group is isomorphic to [R; +]. Each coset of R is a line in the complex plane that is parallel to the x-axis: t : C/R -> R, where 
T([a + bi\a& R)) = bis. an isomorphism. 

(c) <8> = {0, 4, 8, 12, 16} ^ |Z2o/<8>| = 4. 

The four cosets are: 0, 1, 2, and 3. 1 generates all four cosets. The factor group is isomorphic to [Z4, +4] because 1 generates it. 
5. a e bH ^ a = b * h for some h 

^b~^*a = h forsome/i 

« *a e H 



Section 15.3 

(3421) (3421) 

/'1234\ ^ /'1234\ 
(4213) ® (3 1 4 2) 

. ^ M 2 3 4 \ 
I2 1 4 3) 

3. Yes and no, respectively 

5.^4 = {i, r, r\ r\ .f\ h, h, ./4I 

/ 1 2 3 4 \ 
Where ; is the identity function, '" = 12 3 4 j I' ^'^'^ 

. fi 2 3 4 ^ n 2 3 4 ^ 

■'^ U 3 2 1 j •'^ V2 1 4 3 j 

=(3 2 1 4) "(1 4 3 2) 
The operation table for the group is 






i 


r 


P- 


r3 


h 


fl 


f3 


f* 


i 


i 


r 






/i 


fl 


h 


/4 


r 


r 






i 


U 


/3 


fl 


fl 






? 


i 


r 


h 


/l 


/4 


/3 




? 


i 


r 




h 


/4 


fl 


/l 


/i 


h 


h 


fl 


u 


i 




□ 


^3 


/2 


h 


h 


h 


h 




i 




r 


h 


h 


h 




h 


r3 


r 


i 






h 


h 


h 


/2 


r 


r3 


r2 


; 



A lattice diagram of its subgroups is 
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All proper subgroups are cyclic except [i, r^, fi, /2) and {/, r^, /3, /4). Each 2-element subgroup is isomorphic to Z2 ; (/, r, r^, P\ is isomor- 
phic to Z4 ; and {i, r^, f], fi] and [i, r^, fj, f4} are isomorphic to XZ2. 

7. One solution is to cite Exercise 3 at the end of Section 11.3. It can be directly applied to this problem. An induction proof of the problem at 
hand would be almost identical to the proof of the more general statement. 

(fi f2 •■■ /r)"' = •••f2"'fi"' by Exercies 3 of Section 1 1 .3 

= ?2 ?i since each transposition inverts itself. ■ 

9. Part I: That = k ! follows from Exercise 3 of Section 7.3. 

Part II: Let / be the function defined on |1, 2, n] by /(I) = 2, /(2) = 3, /(3) = 1, and f{j) = j for 4 < j<n; and let g be defined by 
g(l) = l,g(2) = 3,g(3) = 2,andgO) = j for4< j<n. Note that /and g are elements of 5„. Next, (/og) (1) = /(g(l)) = /(I) = 2, while 
(go/)(i) = g(/(l)) = g(2) = 3, hence /og go/ and S„ is non-abelian for any n > 3. 

11. (a) Both groups are non-abelian and of order 6; so they must be isomorphic, since only one such group exists up to isomorphism. The 
function : 53 -> R3 defined by 

e{i) = I 9(/i)=Fi 

e{n) = Ri 0(f2) = F2 
e(r2) = R2 e{h) = F, 

is an isomorphism, 

(b) Recall that since every function is a relation, it is natural to translate functions to Boolean matrices. Suppose that / e S„. We will define its 
image, e(/), by 

e(/)A:/=l « fU) = k 

That 6 is a bijection follows from the existence of 9"' . If A is a rook matrix, 

'(^)0) = ^ <ts> The 1 in column j of A appears in row ^ 
« Aj^j = 1 

For/, g e S,„ 

0(f-8)tj = 1 « (f-g)ij) = k 

<ts> 3 / such that g{j) = I and/(/) = k 

« 3 / such that 0(g) ij = 1 and 0(/)^, = 1 

« m)eig)),j =1 

Therefore, 9 is an isomorphism. ■ 
Section 15.4 

l.(a) Yes, the kernel is {1, -1) 

(b) No, since 62(2 +5 4) = (92(1) = 1, but 02(2) +2 02(4) =0+20 =0 

(c) Yes, the kernel is I (a, -a)\ a eIR) 
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(d) No 

3. (r) = {i, r, r- , r'] is a normal subgroup of Di^. To see you could use the table given in the solution of Exercise 5 of Section 15.3 and verify 
that a ' /! a e (r> for all a e Z)4 and h&{r). A more efficient approach is to prove the general theorem that if // is a subgroup G with exactly 
two distinct left cosets, than H is normal. 

</i> is not a normal subgroup of D4. </i> = {;, fi) and if we choose a = r and h=fi then a"' ha= r'fir = f2i (/i> 
5. (ySo a) (ai, 02, 03) = and so P°a is the trivial homomorphism, but a homomorphism nevertheless. 
7. Let X, y & G. 

q{x *y) = {x* yf 

= x*y *x*y 

= X * x*y *y since Gis abelian 

2 2 
= x *y 

= q{x)*q{y) 
Hence, ^ is a homomorphism. 

In order for ^ to be an isomorphism, it must be the case that no element other than the identity is its own inverse. 

X e Ker (q) ^ q{x) = e 
<^ X * X = e 
« x~^ = X 

9. Proof: Recall: The inverse image of H' under d is 

e-\H') = {g^G I 0(g) e//') 

Closure: Let gi g2 £ 6~'(^^')> then ^(gi), 6(^2) Since //' is a subgroup 

of G', 

e(gi)oe{g2) = e{gi*g2) ^ gi*g2&0-\H') 

Identity: By Theorem 15.4.2(a), e e Q-^iH'). 

Inverse: Let a e e-\H') . Then 6(0) e and by Theorem 15.4.2(b), 6(0)-^ = 6(0-^) e ' and so a"' e e-\H'). 
Section 15.5 

1 . (a) Error detected, since an odd number of Is was received; ask for retransmission. 

(b) No error detected; accept this block. 

(c) No error detected; accept this block. 

3. (a) Syndrome = (1, 0, 1). Corrected message = (1, 1, 0). 

(b) Syndrome = (1, 1, 0). Corrected message =(0, 0, 1). 

(c) Syndrome (0, 0, 0). Corrected message = received message. 

= (0, 1, 1) 

(d) Syndrome = (1, 1, 0). Corrected message = (1, 0, 0). 

(e) Syndrome = (1 , 1 , 1). This syndrome occurs only if two bits have been switched. No reliable correction is possible. 

5. Let G be the 9x10 matrix obtained by augmenting the 9x9 identity matrix with a column of ones. The function e : Z2' -> Z2"' defined by 
e(a) = aG will allow us to detect single errors, since e (a) wiU always have an even number of ones. 

Supplementary Exercises— Chapter 15 

1. Theorem 15.1.3 guarantees that all subgroups of any cyclic group can be determined by finding all cyclic subgroups. We can find all cyclic 

subgroups of noncyclic groups but there may be other subgroups. 

3. First, write 120 as a product of powers of distinct primes: 120 = 2^ -3 -5. The Chinese Remainder Theorem states that 
0: Z120 -»ZgXZ3XZ5 defined by 0(^) = (A: mod 8, A: mod 3, A: mod 5) is an isomorphism. In particular, 0(74) = (2, 2, 4) and 
0(85) = (5, 1,0). Therefore, 
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9(14 +120 85) = 6(74) + (9(85) 

= (2, 2, 4) + (5, 1,0) 
= (7, 0, 4) 

Since 6(105) = (1,0, 0), and 6 (96) = (0, 0, 1), we can compute 

e-\7, 0, 4) = 7 Xi2o 105 +120 4 X120 96 . 
= 39 

5. H=0 + H = {0, 4, 8]=4+H = 8+H 

1 + H = {I, 5, 9] = 5 + H = 9 + H 

2 + H = [2, \Q) = 6 + H = XO + H 

3 + H = {3, 7, n} = 1 + H = n + H 

The operation table for this factor group is the same as that of [Z4, +4] with k replaced with k+ H. 
7. (a) |Zg| = 8 and 1(2)1 = 4, therefore there are 2 distinct left cosets, and they are: 

+ <2> = {0, 2, 4, 6} = 2 + <2> = 4 + (2) = 6 + <2> 

1 + <2> = {1, 3, 5, 7} = 3 + <2> = 5 + <2> = 7 + <2> 

(b) IZ12I = 12 and 1(2)1 = 6, therefore there are 2 distinct left cosets and they are: 

+ (2) = (0, 2, 4, 6, 8, 10) = 2 + (2) = 4 + (2) - 6 + (2) = 8 + (2) = 10 + (2) 
and 1 + (2) = |1, 3, 5, 7, 9, 1 1) = 3 + (2) = 5 + (2) = 7 + (2) = 9 + (2) = 11 + (2) 

(c) Since both groups are of order 2 and there is only one group of order 2 up to isomorphism, they are isomorphic. A simpler group is Z2. 
7. Assume /is even, f = ti°t2°---°t2r for some r, where each f, is a transposition. Hence 

= (fi°f2°---°f2r)~' = f2r°---°f2°fi by Exercise 11 of Section 15.3. 
Since the alternative, that/is odd, leads to being odd, /is even if and only if /"' is even. 

11. (a) This following is the "standard definition" of a Boolean algebra homomorphism. 
f :B\ -» B2 is a Boolean algebra homomorphism if and only if for all a, fo, e Bi . 

(1) f{aKb)=f{a)Kf{b) 

(2) /(aV*)=/(a)V/(/') 

(3) fia) =J(d) 

(b) (i) /(O) = /(a A 5) 

= f(a) A /(a) 

= fia) A m 

= 

and 

/(I) = fia V a) 

= fia) V/(a) 

= fia) V m 

= 1 

Note : The and 1 of Bi may be different than those of B2. 

{n) a <, b =^ a = a /\b by Supplementary Exercise 4 of Chapter 13 
=> fia)=fiaAb)=fia)Afib) 
=> fia) <, fib) by the same exercise cited above. 

(iii) See the solution to Exercise 15 of the Supplementary section of Chapter 13 for the definition of Boolean subalgebra. Part (i) of this 
exercise shows that /(Bj) contains the and 1 of B2. The definition in part a shows that /(a) e /(Bi) has a complement, namely 
fia) e /(Bi) , and also that /(Bi) must be closed with respect to both A and V. For example, if a, b e Bi, then a A b e Bi, and since 

fia) A fib) = / (a A b), fia) Afih) e/(Bi). 
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I' 1 1 1 
10 10 1 
1 1 1 

,0 1 1 1 1 ; 

(b) e(llll) = 1111111 ande(lOOl) = 1001001 

(c) (i) Syndrome = 101 => Error in second bit, since 101 is the second row of P. 

Corrected message = 0000. 

(ii) Syndrome = 000 => No error in transmission. Correct message is 1010. 

(iii) Syndrome = 001 => Error in seventh bit, since 001 is the seventh row of P. 

Corrected message = 1011. (Since the error was in a parity bit, the actual message is not corrected.) 

(d) The most direct way of proving that all single errors can be corrected is to compute the syndromes of each of the seven possible one- 
bit errors. Since each of them produces a distinct syndrome (the rows of P), single errors can always be corrected. 

CHAPTER 16 
Section 16.1 

I . All but rings c and e are commutative. All of the rings have a unity element. The number 1 is the unity for all of the rings except c and e. The 
unity for M2x2(K) is the two by two identity matrix; the unity for M„xn(K) is the n by n identity matrix. The units are as follows: 

(a) (1, -1) 

(b) C 

(c) {A I |A| = ±1} 

(d) Q' 

(e) {A I A11A22-A12A21 +0} 

(f) {1} 

3. Hints: (a) Consider commutativity 

(b) Solve = 3 X in both rings. 

5. (a) We already know that 3 Z is a subgroup of the group Z; so part 1 of Theorem 16.1 .1 is satisfied. We need only show that part 2 of the 
theorem holds: Let 3 m, 3 n e 3 Z. 

(3m)(3n) = 3(3 mn) e 3 Z, since 3 mn eZ. ■ 

(b) The proper subrings are {0, 2, 4, 6) and {0, 4); while (0) and Zg are improper subrings. 

(c) The proper subrings are {00, 01), |00, 10), and (00, 11): while {00} and Z2 XZ2 are improper subrings. 

7. (a) The left-hand side of the equation factors into the product [x - 2) (x - 3). Since Z is an integral domain, x = 2 and x = 3 are the only 
possible solutions. 

(b) Over Z12, 2, 3, 6, and 1 1 are solutions. Although the equation factors into (x - 2) (x - 3), this product can be zero without making x either 2 
or 3. For example. If x = 6 we get (6 - 2)Xi2(6 - 3) = 4 Xi23 = 0. Notice that 4 and 3 are divisors of zero. 

9. Leti?i,if2>andi?3 be any rings, then 

(a) Ri is isomorphic to if 1 and so "is isomorphic to" is a reflexive relation on rings, 

(b) i?i is isomorphic to R2 => R2 is isomorphic to if 1 , and so "is isomorphic to" is a symmetric relation on rings, 

(c) if 1 is isomorphic to if2, and if2 is isomorphic to R3 implies that ifi is isomorphic to if3, and so "is isomorphic to" is a transitive relation on 
rings. 

We haven't proven these properties here, just stated them. The combination of these observations implies that "is isomorphic to" is an 

equivalence relation on rings, 

I I . (a) Commutativity is clear from examination of a multiplication table for Z2 x Z3. More generally, we could prove a theorem that the direct 
product oftwoormore commutative rings is commutative. (1, 1) is the unity of Z2X Z3. 

(b) ((m, n) \ m = or n = 0, (m, n) * (0, 0)) 

(c) Another example is Z x Z. No, since by definition an integral domain D must contain the additive identity so we always have 
(m, 0) • (0, n) = (0, 0) inD X D. 
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13. (a) (a + b)(c + d) = (a + b)c + (a + h)d 

= ac + be + ad + bd 

(b) (a + b)(a + b) = aa + ba + ab + bb byparta 

= aa + ah + ab + bb since is commutative 

= a^ + lab + b^ 

15. Hint: The set of units of a ring is a group under multiplication. Apply a theorem from a group theory. 

17. Proof of Corollary to Theorem 6.1.4: Since p is a prime, all nonzero elements of Zp are relatively prime to p. By Theorem 16.1.4 we are 

done. 

Section 16.2 

3. No, since 2"' = 2 in Z3, but a"' ta and fo"' +b'mF. 
5. (a) (over Z2), 1 (over Z3), 3 (over Zj ) 

(b) 2 (overZs ), 3 (overZj) 

(c) 2 

7. (a) Oandl (b) 1 (c) 1 (d)none 

9. (c) The roots of - 2 = are ^^2 and -V2". Both numbers can be expressed in the form a +fcV2" where a, b & 
-VT = + 1 ■VTand-V2"= + -1 -VT. 

(d) No, since ±V3" caimot be expressed in the form a + b^fl ,a, e Q. If there exist rational numbers a and b such that 
Vs" = a + b V2", then clearly b + Q) since Vs" is irrational and a * for that would imply that V 3/2 is rational, which is false. If we 
square both sides, of the equation we wiU get a rational expression for V2~ which is also false. 

Section 16.3 

I. (i)/(jc) + g(x) = 2 + 2x + , f(x)g(x) =1 +2x +2x'^+x^ 

(ii) f(x) + g(x) = x', f (X) g(x) = l+x^ 

(iii) 1 + 3x + 4x^ + 3x^ + x'^ 

(iv) I + X + x^ + x'' 

(v) ^ + x^ 

3. (a) If a, b e IR,a - and a are in R since R is a ring in its own right. Therefore, R is a subring of RM. The proofs of parts b and c are 
similar. 

5. (a) Reducible, (x + \)(x^ + x + 1) 

(b) Reducible, x{x^ + x + 1) 

(c) Irreducible. If you could factor this polynomial, one factor would be either x or x + 1, which would give you a root of or 1, 
respectively. By substitution of and 1 into this polynomial, it clearly has no roots. 

(d) Reducible, {x + 1)'' 

7. We illustrate this property of polynomials by showing that it is not true for a nonprime polynomial in Z2M. Suppose that p(x) = + \, 

which can be reduced to {x + if- ,a (x) = x^ + x, and b (x) = x^ + x^. Since a (x) b{x) = x^ + x^ = x^(x^ + 1), p(x) \ a(x) b(x). However, p (x) 
is not a factor of either a(x) or b (x). 

9. The only possible proper factors of x - 3 are (x - V3~) and (x + Vs"), which are not in Q[x] but are in R[x]. 

I I . For n > 0, let S(n) be the proposition: For all g{x) tO and / (x) with deg / (x) = n, there exist unique polynomials q (x) and r (x) such that 
fix) = g(x) q(x) + r(x), and either r(x) = or deg r (x) < degg (x). 

Basis: S (0) is true, for if / (x) has degree 0, it is a nonzero constant, /(x) = ci=0, and so either /(x) = g(x) • + c if g(x) is not a constant, or 

/ (x) = g(x) g(x)"' + if g(x) is also a constant. 

Induction: Assume that for some n>0, S{k) is true for all ^ < n, If /(x) has degree n + 1, then there are two cases to consider. If 
degg(x) > n + l,/(x) = g(x)0 + /(x), and we are done. Otherwise, if deg g(x) =m < n + 1, we perform long division as follows, where 
LDT's = various terms of lower degree than n + 1 . 
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n+1 8m ^ 



1 yrt+l-m 



g„x"+LDTs )/„+! J<:"+' + LDT'i 
Uix"^' + LDTS 
h{x) 

Therefore, 
h{x) = fix) - (/„+i -g,,,-' ^ 

Since deg h (x) is less than n + 1 , we can apply the induction hypothesis: 

h{x) = g(x)q(x) + r(x) with degr(x) < degg(x). 
Therefore, 

fix) = 8ix)if„+i-g„-^ x"*^-" + qix)) + rix) with degrW < deggW. 

This establishes the existence of a quotient and remainder. The uniqueness of q ix) and r ix) as stated in the theorem is proven as follows: if 
/ ix) is also equal to gix) q(x) + fix) with deg r(x) < deg gix), then 

six) qix) + rix) = gix) qix) +~rix) => g{x) iqix) - qix)) = rix) -rix) 

Since degr(x) - rix) < deg gix), the degree of both sides of the last equation is less than deg gix). Therefore, it must be that 
qix) - qix) = 0, or qix) = qix) And so rix) = fix). ■ 

Section 16.4 



1. If Co + ai V2" e q|^V2"] is nonzero, then it has a multiplicative inverse: 

1 1 oo-ai VT 



ao -2a! a{)~-la\ 



The denominator, qq^ - 2a\^ ,ii nonzero since is irrational. Since — - and are both rational numbers, gq -\- a\ ^fl is a unit of 

aQ--2a\- ao-—2ai 

Q[V2~j. The field containing Q[V2"j is denoted q(V2~) and so q(V2") = q[V2~] 

3. x"* - + 6 = (x^ - 2) - 3) has zeros ± V2~ and ± V3~. Q(V2~j = l^a + b VT | a, ^) e q| contains the zeros + V2" but does 
not contain ± , since neither are expressible in the form a + h V2~ . If we consider the set then this field 

contains ± Vs" as well as ± Vz", an is denoted |(Q|V2^jj (VT j = q|VT, VT j. Taking into accoimt the form of c and d in the description 

above, we can expand to 

q(V2~, VT) = jfoo + ^'1 VT + ^2 VT + ^3 VT I fo, e q}. 



5. (a) / (x) = x' + X + 1 is reducible if and only if it has a factor of the form x - a. By Theorem 16.3.3, x - a is a factor if and only if a is a 
zero. Neither nor 1 is a zero of / (x) over Z2. 

(b) Since / (x) is irreducible over Z2, all zeros of / (x) must lie in an extension field of Z2 . Let c be a zero of / (x). Z2(c) can be described 
several different ways. One way is to note that since c e Z2(c), c" e Z2(c) for all n. Therefore, Z2(c) includes 0, c, c^, c^, .... But = c + I 
since /(c) = 0. Furthermore, c'^ = cP- + c, c' = + c + 1 , = + 1, and = 1. Higher powers of c repeat preceding powers. Therefore, 

Z2(c)= {0, 1, C, , c + 1, + 1, + c + I, + c}. 

= [ag + ai c + 02 I e Z2) 

The three zeros of fix) are c, and + c. 

fix) = (x + c)(x+ c^)ix + + c). 

(c) Cite Theorem 16.2.4, part 3. 

Section 16.5 

3. Theorem 16.5.2 proves that not all nonzero elements in F[[x]] are units. 
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7. (a) bo = 1 

foi=(-l)(2-l) = -2 

&2 = (-l)(2-(-2) + 4-l)= 
b3= (-l)(2-0 + 4-(-2) + 8-l)=0 
. . . (all others are zero) 

Hence, /(x)"' = l-2x 

(b) f(x) = I + 2x + 2^x^+2^:)^+ ■■■ 

= {2xf + {2x)^ + {2xf + {2xf + ■■■ 
_ 1 

~ l-2x 

The last step follows from the formula for the sum of a geometric series. 

9. (a) (x" - 2 + x^y^ = (x^ ix^-2x+ 1))"' 
= x-^(l -2x + x2)-' 

= x-2 |(yt+l)x*] by Example 2 of 16.5 
\k=o ) 

oo 

= Z (k + 2)x^ 

k=-2 

Supplementary Exercises— Chapter 16 

1 . (a) This ring is not commutative. 

(A + B)2= (A + B)-(A + B) 

= (A + B)-A + (A + 5)-B 
= A-A + B-A + A-B +BB 
= A^+ BA + A-B + B^ 

(b) Yes 

3. (a) By Theorem 16.1.1 show: 

(1) [D +] is a subgroup of the group LA^2x2(IR); +]. We leave this to the reader. 

I a \ I c \ 

(2) D is closed imder multiplication. To prove this, let I q ^ I' I q d I^^' 

( a \( c \ (ac \ 

lo b][o d]=[o bd}^" 

since a c and & are real numbers and the product is in the form of a typical matrix in D. 
(b) Since 



la OWc 0\_lac \ _ / c \l a 0\ 
[o b)[o dj~[o bdj~[o d j[o bj' 

( 1 0\ 

D IS commutative. The umty for Z) is I ^ ^ I. 

(c) The product of two nonzero matrices can be equal to zero. For example, o)(o ^)~(o o)' ^ divisors of zero and 

by Theorem 16.1 .2 the cancellation law is not true in D. 
5. (a) 2* = 16 

(b) The product cited in the solution to 3(c) above shows that M2x2(IR) has divisors of zero. Therefore, the matrix polynomial 
(x - I)ix + 1) may have solutions other then ±7. If fact you can verify that ^ ^ j j and ^ | ^ j satisfy the given equation. 



. Use r : A ^ R defined by q ) ) ^ 
9. By substitution and the operation tables of Example 16.2.2, 
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+ a + 1 = h + a + I 
=1+1=0 

Therefore, a is a root. A similar calculation shows that hisa root. Substitution of and 1 for x shows that they are not root. 

11. By Theorem 16.3.3, a e Q is a zero of f{x) iff (x - a) is a factor of f(x), which also implies a must be a factor of 9. Hence, the only 
possible rational roots are: ±1, ±3, and ±9. We can verify that (x - 3) is a divisor of f{x) or that x = 3 is a zero of f (x). Dividing /(a) by 
(x - 3) produces qix) =x^ -3x^ + x - 3, which has x = 3 as a rational root. Dividing q(x) by x-3 produces x^ + 1. Hence, the complete 
factorization of /(x) in Q[x] is (x - 3)^(x^+ 1). 
13. 8(0) = 0,8(1) = 1, 

8(a) = + (p- + a = 1 +b + a = l + l=0, and 

gih) = + b = 1 + a + b =1 + 1 = 0. 

Hence, 0, a, and b are zeros of g (x) and the g(x) = x(x -a)(x- b) = x(x + a)(x + b). 

15. (a) Sum = (1,0, 1), Product = (0, 1, 1, 1) 

(b) Sum = (1,0, 0, 0), Product = (0, 1, 1, 1, 0, 0, 1) 

(c) Sum = (1, 1, 1, 0, 0), Product = (0, 0, 0, 0, 1, 1, 1, 0, 1) 

(d) Sum = 010, Product = 11011 

16. The encoding of a string of bits is based on polynomial division. Given a four bit message, we make the bits coefficients of a sixth degree 
polynomial, b^x^ + b^x'* + b^x^ + b^x^ which we can also express in as (0,0,0, b^, b4, b^, b^), we divide this polynomial by 
p(x) = 1 + X + x^ and add the remainder to the "message polynomial. The quotient is in the division is discarded. Thus, if the remainder, 
which must be a polynomial of degree less than 2, is 60 + bix + b2X^, the encoded message is the string of bits (bo, bi, b2, b^, b^, b^, b^). 

(a) Encode the following elements of Z2*as described above. 

(a) (0, 0, 0, 1, 1, 0, 1) 

(b) (0, 0, 0, 1, 1, 1, 1) 

(c) (0, 0, 0, 0, 0, 1, 0) 

(b) Prove that the encoded message will always represent a polynomial with is evenly divisible by the polynomial p(x) that is used to 
encode the message. 

17. If the message polynomial is m(x) = b^x^ + b^x^ + b^x? + b(,x^ divide by p(x)= 1 + x + x^ and get a quotient and remainder: 
m(x) = p(x)q(x) + r(x), where the degree of r(x) is less than 3. We transmit t(x) = m(x) + r(x) = m(x) + (m(x) - p(x)q(x)) = p(x)q(x) 
since m(x) + m(x) = 0. Now assume that the error x* is added and we receive p(x) q(x) + . Since x*^, < ^ < 6, is not a multiple of p(x), the 
received polynomial is also not a multiple of p(x). The following Mathematica calculation verifies this last claim. 

({x", PolynomialRemalnder [x", + x + 1, x. Modulus ^ 2] ]■ & /@Range[0, 6] ) // 
Prepend[tt, {"Monomial", "Remainder"}] & 

( Monomial Remainder ^ 



1 


1 


X 


X 






x2 


x2 


x^ 


X + 1 


X* 


X^ + X 




X^ + X + 1 


x" 


x2 + 1 



19. (a) bix) = x^ + x" + 1 = ^(x) (x^ + X + 1) + ^ a = 111 

(b) b(x) = X* + x' + x^ + 1 = g(x) x^ + 1 
=> error in the first bit of 

e(a) = 001 101 
=> a = 001 

Getting a from e(a) involves doing this calculation: 
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PolynomialQuotient [x^ + x^+x^, x^+ x + 1, x. Modulus -» 2] 

(c) b{x) = + x + I = g{x) (x^ + \) + x^ 

^ error in the third bit of b 
=)e(a) = 111001 
^ a = 101 

PolynomialQuotient [x^ + x^ + x + l, x''+ x + 1, x. Modulus -» 2] 

x^ + 1 

(d) b(x) = x'^ + x^+x+\ = g{x) (x+\) + x^ + X 

=> error in the fifth bit of b 

=> e(a) = 110100 (the string representation of 

=> a = 100 

21. (a) g(x) is irreducible over Z2 since g(0) = g(l) = 1. Hence, g(x) does not split in Z2. Let /3 be a zero of g(x), so that 
^2[j8] = [a + b p + c a, b, c e Z.2}. This is a field of 2^ = 8 elements which, by Theorem 16.2.4, is isomorphic to GF(8). 

23. l/g(x) = f(x) of Example 16.5.2. 

1 + 2;c + 3x^ + 4x^ + ■■■ 

i-2x+x^ n 

1 -2x + 2x^ 

2x-4x^ + 2x^ 
■3x^-2x^ 
ix^ -6x^ +2x'^ 
Ax^ -Zx'^ 
Ax^ - %x'^ +4x^ 
Sx'^-Ax^ 

25. (a) Co = oi = 1 , 02 = 2, 03 = 3, 04 = 5, , so 

fix) = 1 + X + 2x- + 3x3 + 5x'* + ... ^ 
(b) flo = oi = 1, 02 = 0> 03 = 1, 04 = 1, 05 = 0, ... . 

g (x) = 1 + X + + + x** + + + + • • • 
= (1 + x) + x^{\ + x) + x*(l + x) + ••• 
= (1 +x)(l +x^ + x'' + ■■■) 

(i+.t) 
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